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Like other Schaum’s Outlines, this book is intended to be used primarily for self study. It is 
suitable as a study guide in a course on data structures using the Java programming language. In 
American universities, this is typically the second course in the computer science major. The 
book is also serves well as a reference on data structures and the Java Collections Framework. 

The book includes more than 200 detailed examples and over 260 solved problems. The 
author firmly believes that programming is learned best by practice, following a well-constructed 
collection of examples with complete explanations. This book is designed to provide that 
support. 

This second edition is a major improvement over the original 2001 edition. Most of the 
chapters have been completely rewritten. Three entirely new chapters have been added, on 
object-oriented programming, linked structures, and the Java Collections Framework. 

Java 6.0 is used throughout the book, with special attention to these new features of the 
language: 

¢ The Scanner class. 

¢ The StringBuilder class. 

¢ Formatted output, including the printf(Q method. 

¢ The enhanced for loop (also called the for-each loop). 

¢ Static imports. 

* enum types. 

¢ Variable length parameter lists. 

¢« Autoboxing. 

* Generic classes 

¢ The Deque, ArrayDeque, EnumSet, and EnumMap classes, and the Queue interface 
in the Java Collections Framework. 

Source code for all the examples, solved problems, and supplementary programming 
problems may be downloaded from the author’s Web site 

http://www.mathcs. richmond. edu/~hubbard/books/ 

I wish to thank all my friends, colleagues, students, and the McGraw-Hill staff who have 
helped me with the critical review of this manuscript, including Stephan Chipilov and Sheena 
Walker. Special thanks to my colleague Anita Huray Hubbard for her advice, encouragement, 
and supply of creative problems for this book. 


JOHN R. HUBBARD 
Richmond, Virginia 
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Object-Oriented Programming 


SOFTWARE DESIGN AND DEVELOPMENT 


Successful computer software is produced in a sequence of 
stages that are typically managed by separate teams of develop- 
ers. These stages are illustrated in Figure 1.1. 

The first stage is a recognition of the problem to be solved. In 
a corporate setting, this determination could come from market 
research. 

The second stage, which might be omitted as a formal 
process, is a study of whether the project is feasible. For 
example, do the development tools exist to produce the 
software? 

In the third stage, a document is typically produced that 
specifies precisely what the software should do. This require- 
ments document should have enough detail to be used as a 
standard when the completed software is tested. 

In the fourth stage, a thorough analysis is done before any 
effort or resources are spent designing and implementing the 
project. This could include a survey of comparable software 
already available and a cost-benefit analysis of the value of 
spending the anticipated resources. 

Once a decision has been made to proceed, the software 
design team works from the requirements document to design 
the software. This includes the specification of all the software 
components and their interrelationships. It may also require the 
specification of specialized algorithms that would be imple- 
mented in the software. 

The implementation consists of programmers coding the 
design to produce the software. 

The testing team attempts to ensure that the resulting 
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Feasibility 


y 
Requirements 
y 
Analysis 
a 
Design 
Implementation 


Testing 
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Solution 


Figure 1.1 Software life cycle 


software satisfies the requirements document. Failure at this point may require a redesign or even 
some fine-tuning of the requirements. Those eventualities are represented by the two feedback 


loops shown in Figure 1.1. 
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Testing occurs at several levels. Individual classes and methods have to be tested separately, 
and then their success at working together must be verified. Finally, the product as a whole is 
tested against the requirements document. 

One final aspect of software development that is not shown in the figure is the maintenance 
process. After the software has been delivered, its developers remain obliged to maintain it with 
corrected versions, service packages, and even major revisions. Any major revision itself would 
follow the same life cycle steps. 


OBJECT-ORIENTED DESIGN 


One common approach to software design is a top-down design strategy that gradually breaks 
the problem down into smaller parts. This is also called step-wise refinement. It focuses on the 
functional aspects of the problem and the implementation of algorithms. This procedure-oriented 
design is common in scientific programming. 

In contrast, object-oriented design focuses on the data components of the software, organizing 
the design around their representatives. For example, an air traffic control system might be 
designed in terms of airplanes, airports, pilots, controllers, and other “objects.” 

The Java programming language is particularly well-suited for implementing object-oriented 
designs. All executable code in Java is organized into classes that represent objects. For this 
reason, Java is regarded as an object-oriented programming language. 

An object is a software unit that is produced according to a unique class specification. It is 
called an instance of its class, and the process of creating it is called instantiating the class. For 
example, this code instantiates the java.util.Date class: 

java.util.Date today = new 


java.util.DateQ; ee 
The variable today is a reference to the object, as shown in 
Figure 1.2. Ignoring the distinction between a reference and java.util.Date 
the object to which it refers, we would also say today is the Figure 1.2 A Java object 


name of the java.util.Date object. 
A Java class consists of three kinds of members: fields, methods, and constructors. The fields 
hold the data for class objects, the methods hold the statements that are executed by the objects, 
and the constructors hold the code that initializes the objects’ fields. 
An object-oriented design specifies the classes that will be 
instantiated in the software. That design can be facilitated and 


Person 


illustrated by the Unified Modeling Language (UML). In Eonar Sa 
UML, each class is represented by a rectangle with separate # sex: char 
# dob: java.util.Date 


parts for listing the class’s name, its fields, and its methods and 
constructors. 

Figure 1.3 shows a UML diagram for a Person class with 
four fields (name, id, sex, and dob), a constructor, and three 
methods (isAnAdult(Q), setDob(), and toString()). Each 
of the eight class members is prefaced with a visibility symbol: Figure 1.3 A UML diagram 

+ means public 
# for protected 
- for private 
(Package visibility has no UML symbol.) 


Person(String, int, char) 
isAnAdult(): boolean 
setDob(java.util.Date) 
toString(): String 


tet 
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UML diagrams are independent of any implementing programming language. They are used 
in object-oriented design to specify objects. They should be easy to implement in Java, C++, or 
any other object-oriented programming language. They provide a kind of pseudo-code for 
classes. They specify the state (i.e., fields) and the behavior (i.e., methods) of an object without 
specifying how that behavior is accomplished. UML diagrams include no executable code. 


Specifying what an object can do without specifying how it does it is an abstraction. It allows 
the design stage to be separated from the implementation stage of the software development. It 
also facilitates modification of the software by allowing an implementation to be changed 
without affecting dependent modules. As long as a method’s behavior is unchanged, any invok- 
ing modules will be unaffected by a change in that method’s implementation. 


For example, suppose that an airline reservation system uses the Person class specified by the 
UML diagram in Figure 1.3. Presumably, that software will invoke that class’s isAnAdu1lt(Q) 
method in various modules of the system. The “contract” specified by the software design only 
requires that the method return the right answer: x.isAnAdu1t() should be true if and only if x 
is an adult. How it computes that result is irrelevant. The implementation probably computes the 
chronological difference between the value of the private field x.dob and the value of the 
current date. But there is nothing in the contract that specifies that. Moreover, if the implementa- 
tion is changed, none of the other code in the reservation system would be affected by that 
change. Such a change might be warranted by the preference of a different algorithm for comput- 
ing chronological differences, or possibly by a redefinition of the meaning of “adult.” 


Concealing the implementation of a method from the clients who use the method is called 
information hiding. It is the software designer’s version of the spy’s principle that says, “If you 
don’t need to know it, then you’re are not allowed to know it.” It makes software easier to 
design, implement, and modify. 


ABSTRACT DATA TYPES 


Abstractions are used to help understand complex systems. Even though they are different, 
rocks and tennis balls fall at the same rate. The physicist uses the abstraction of imagining a 
single imaginary point mass to understand the physics of falling bodies. By ignoring the irrele- 
vancies (diameter, weight), the abstraction allows the analyst to focus on the relevancies (height). 


Abstractions are widely used in software development. UML diagrams provide abstractions 
by focusing on the fields (the state) and methods (the behavior) of a class. But at some levels, 
even the fields of a class may be irrelevant. 


An abstract data type (ADT) is a specification of only the behavior of instances of that type. 
Such a specification may be all that is needed to design a module that uses the type. 


Primitive types are like ADTs. We know what the int type can do (add, subtract, multiply, 
etc.). But we need not know how it does these operations. And we need not even know how an 
int is actually stored. As clients, we can use the int operations without having to know how 
they are implemented. In fact, if we had to think about how they are implemented, it would 
probably be a distraction from designing the software that will use them. Likewise, if you had to 
think about how your car manages to turn its front wheels when you turn the steering wheel, it 
would probably be more difficult to drive! 
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EXAMPLE 1.1 An ADT for Fractions 


Most programming languages have types for integers and real (decimal) numbers, but not for fractions. 
Such numbers can be implemented as objects. Here is a design for a fraction type: 
ADT: Fraction 
plus(Fraction): Fraction 
times(Integer): Fraction 
times(Fraction): Fraction 
reciprocal(): Fraction 
value(): Real 
This ADT specifies five operations. Note that the times() operation is overloaded. 
Note that the ADT uses generic terms for types: Integer instead of int, and Real instead of doub]e. 
That is because it is supposed to be independent of any specific programming language. 
In general, a complete ADT would also include documentation that explains exactly how each opera- 
tion should behave. For example, 
.plus(y) returns the Fraction that represents x + y 
.times(n) returns the Fraction that represents n*x 
.times(y) returns the Fraction that represents x*y 
.reciprocal() returns the Fraction that represents 1/x 
.value() returns the numerical value of x 


x 


x x x xX 


UML diagrams can be used to specify ADTs simply by 
omitting the state information. The Fraction ADT defined in 
Example 1.1 is shown as a UML diagram in Figure 1.4. 

ADTs can be used in pseudocode to implement algorithms 
independently of any specific programming language. This is 
illustrated in Example 1.2. 


Fraction 


plus(Fraction): Fraction 
times(Integer): Fraction 
times(Fraction): Fraction 
reciprocal(): Fraction 
value(): Real 


t+etett 


Figure 1.4 An ADT in UML 
EXAMPLE 1.2 Using an ADT in an Algorithm 


The harmonic mean of two numbers x and y is the number / defined by the formula / = 2/(1/x + 1/y). In 


pseudocode for Fraction types, this could be expressed as: 
harmonicMean(x: Fraction, y: Fraction) returns Fraction 
return x.reciprocal().plus(y.reciprocal()).reciprocal().times(2) ; 


JAVA INTERFACES 


In Java, an ADT can be represented by an interface. Recall that a Java interface is just like a 
Java class, except that it contains no executable code. 


EXAMPLE 1.3 A Fraction Interface 


1 public interface Fraction { 

2 Fraction plus(Fraction x); 
3 Fraction times(int n); 

4 Fraction times(Fraction x); 
5 Fraction reciprocal(); 

6 double value(); 
7 } 


This is a direct translation into Java of the ADT specified in Example 1.1. 


If an ADT is translated into Java as an interface, then we can implement algorithms that use it 
as Java methods. 
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EXAMPLE 1.4 A harmonicMean() Method 


1 public Fraction harmonicMean(Fraction x, Fraction y) { 
2 return x.reciprocal(.plus(y.reciprocal()).reciprocal().times(2); 


3 } 
Although the Java code in Example 1.4 cannot be executed, we can compile it. 


In Java, an interface is a type. Reference variables may be declared to have an interface type, 
even if the interface has no implementation. For example, the parameters x and y at line 1| of 
Example 1.4 are declared to have type Fraction. 

An interface may represent an ADT, as in Example 1.3. More generally, a Java interface is 
meant to identify a capability. For example, the Comparab1e interface requires the implementa- 
tion of this method: 

int compareTo(CT type) 
This means that any variable declared to have type Comparable can invoke this method, 
meaning that it is capable of being compared to other objects. 


CLASSES AND OBJECTS 


Java is a strongly typed language: Every variable must be declared to have a data type. The 
various Java types are shown in Figure 1.5. These are categorized as either primitive types or 
reference types. The eight built-in primitive types are for integers, characters, decimal numbers, 
and boolean values. Reference types are user-defined, and their variables must be instantiated to 
hold data. Arrays are reviewed in Chapter 2; interfaces are types that cannot be instantiated; 
enum types are defined by listing all the values that a variable of that type may have. 

Classes are concrete data types that specify how their state is stored (the class fields) and how 
their instances behave (the instance methods). A class is defined in a declaration statement with 
this syntax: 

modifers class class-name associations { 
declarations 


} 


where modi fers are keywords such as public and abstract, class-name is an identifier such 
as Person that names the class, associations are clauses such as extends Object, and 
declarations are declarations of the class’s members. 
A class can have six kinds of members: 
1. Fields that specify the kind of data that the objects hold. 


2. Constructors that specify how the objects are to be created. 

3. Methods that specify the operations that the objects can perform. 
4. Nested classes. 

5. Interfaces. 


6. Enum type definitions. 

Each member of a class must be specified in its own declaration statement. The purpose of a 
declaration is to introduce an identifier to the compiler. It provides all the information that the 
compiler needs in order to compile statements that use that identifier. 

A field is a variable that holds data for the object. The simplest kind of field declaration has 
this syntax: 

modifers type name = initializer; 
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where modifers and the initializer are 
optional. For example, the Point class in 
Example 1.5 declares two fields at lines 2-3. 
Each has the modifier protected, which means 
that they are accessible only from within the 
class itself and from its extensions. 
A constructor is a subprogram that creates an 
object. It’s like a method with these distinctions: 
¢ Its name is the same as its class 
name. 
¢ It has no return type. 
¢ It is invoked by the new operator. 
The simplest kind of constructor declaration 
has this syntax: 
modifers name(param-decls) { 
statements 


i 


Note that a class need not have a main() 
method. If it does, it is then an executable 
program. Otherwise, it merely defines a new 
type that can be used elsewhere. 


EXAMPLE 1.5 A Ratio Class 


Java data types 
-— Primitive types 
-— Numeric types 
-— Integer types 
-— Integers 
-— byte 
-— short 
t+— int 
— long 
‘— Characters 
—— byte 
‘— Floating point types 
t— float 
— double 
‘— Boolean type 

\— boolean 
‘— Reference types 
-— Arrays 
t— Interfaces 
i— Classes 
‘— Enums 


Figure 1.5 Java types 


1 public class Ratio { 

2 protected int num; 

3 protected int den; 

4 public static final Ratio ZERO = new RatioQ; 
5 

6 private RatioQ) { 

7 thisco, 1); 

8 } 

9 

10 public RatioCint num, int den) { 

1 this.num = num; 

12 this.den = den; 

13 } 

14 

15 public boolean equals(Object object) f{ 

16 if (object==this) { 

17 return true; 

18 } else if C!Cobject instanceof Ratio)) { 
19 return false; 

20 } 

21 Ratio that = (Ratio)object; 

22 return (this.num*that.den == that.num*this.den) ; 
23 } 

24 

25 public int getNum() { 

26 return num; 


27 } 


CHAP. 1] OBJECT-ORIENTED PROGRAMMING 7 


29 public int getDen() { 

30 return den; 

31 } 

32 

33 public String toString( { 
34 return String.format("%d/%d", num, den); 
35 } 

36 

37 public double value() { 

38 return (double)num/den; 
39 } 

40 } 


Instances of this class represent fractions, with numerator (num) and denominator (den). The static 
final field ZERO represents the fraction 0/1. It is defined to be static because it is unique to the class 
itself — there shouldn’t be more than one ZERO object. 

In addition to its three fields, this class has two constructors and four methods. The no-arg constructor 
(it has no arguments) defined at line 6 is declared private so that it cannot be invoked from outside of its 
class. It is invoked at line 4 to initialize the ZERO object. This constructor uses the this keyword at line 7 to 
invoke the two-arg constructor, passing 0 to num and 1 to den. 

The two-arg constructor at line 10 is provided to the public for constructing Ratio objects with specific 
num and den values. Note that, to prevent the ZERO object from being duplicated, we could have included 
this at linel1: 

if Cnum == 0) { 
throw new Il]llegalArgumentException("Use Ratio.ZERO") ; 


t 

But then we would have to replace line 7 with explicit initializations: 
num = 0; 
den = 1; 


instead of invoking the two-arg constructor there. 

The equals() method at line 15 overrides the default equalsQ method that is defined in the Object 
class (which all other classes extend). Its purpose is to return true if and only if its explicit argument 
(object) represents the same thing as its implicit argument (this). It returns true immediately (at line 
17) if the two objects are merely different names for the same object. On the other hand, it returns false 
(at line 19) if the explicit argument is not even the right type. These tests for the two extremes are canoni- 
cal and should be done first in every equals() method. If they both are passed, then we can recast the 
explicit argument as an object of the same type as the implicit argument (Ratio) so we can access its 
fields (num and den). The test for equality of two ratios a/b = c/d is whether a*d = b*c, which is done at 
line 22. 

The methods defined at lines 25 and 29 are accessor methods (also called “getter methods”) providing 
public access to the class’s private fields. 

The toString() method at line 33 also overrides the corresponding method that is defined in the 
Object class. Its purpose is to return a String representation of its implicit argument. It is invoked 
automatically whenever a reference to an instance of the class appears in an expression where a String 
object is expected. For example, at line 6 in the test program below, the expression "x = " + x concate- 
nates the string "x = " with the reference x. That reference is replaced by the string "22/7" that is 
returned by an implicit invocation of the toString() method. 

Finally, the valueQ) method at line 37 returns a decimal approximation of the numerical value of the 
ratio. For example, at line 7 in the test program below, x.value() returns the double value 
3.142857142857143. 
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The program tests the Ratio class: 
1 public class TestRatio { 
2 public static void main(String[] args) { 
3 System.out.printInC"Ratio.ZERO = " + Ratio.ZERO); 
4 System.out.printInC"Ratio.ZERO.value() = " + Ratio.ZERO.valueQ); 
5 Ratio x = new Ratio(22, 7); 
6 System.out.printIn("x = " + x); 
7 System.out.printIn("x.valueQ) = " + x.valueQ); 
8 System.out.printIn("x.equals(Ratio.ZERO): " + x.equals(Ratio.ZERO)) ; 
9 Ratio xx = new Ratio(44, 14); 
10 System.out.printIn("xx = " + xx); 
1 System.out.printIn("xx.valueQ) = " + xx.value()); 
12 System.out.printIn("x.equals(xx): " + x.equals(xx)); 
13 } 
14 } 
The output is: 
Ratio.ZERO = 0/1 
Ratio.ZERO.valueQ = 0.0 
xX = 22/7 
x.valueQ) = 3.142857142857143 
x.equals(Ratio.ZERO): false 
xx = 44/14 
xx.value() = 3.142857142857143 
x.equals(xx): true 
The Ratio class in Example 1.5 is immutable: its fields cannot be changed. 
MODIFIERS 


Modifiers are used in the declaration of class members and local variables. These are summa- 
rized in the following tables. 


Modifier 


Meaning 


abstract 
final 


public 


strictfp 


The class cannot be instantiated. 
The class cannot be extended. 


Its members can be accessed from any other class. 


Floating-point results will be platform-independent. 


Table 1.1 Modifiers for classes, interfaces, and enums 


Modifier 


Meaning 


private 
protected 


public 


It is accessible only from within its own class. 
It is accessible only from within its own class and its extensions. 


It is accessible from all classes. 


Table 1.2 Constructor modifiers 
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Modifier Meaning 
final It must be initialized and cannot be changed. 
private It is accessible only from within its own class. 
protected It is accessible only from within its own class and its extensions. 
public It is accessible from all classes. 
static The same storage is used for all instances of the class. 
transient It is not part of the persistent state of an object. 
volatile It may be modified by asynchronous threads. 


Table 1.3 Field modifiers 


Modifier Meaning 
abstract Its body is absent; to be defined in a subclass. 
final It cannot be overridden in class extensions. 
native Its body is implemented in another programming language. 
private It is accessible only from within its own class. 
protected It is accessible only from within its own class and its extensions. 
public It is accessible from all classes. 
static It has no implicit argument. 
strictfp Its floating-point results will be platform-independent. 
synchronized | It must be locked before it can be invoked by a thread. 
volatile It may be modified by asynchronous threads. 


Table 1.4 Method modifiers 


Modifier Meaning 


final It must be initialized and cannot be changed. 


Table 1.5 Local variable modifier 


The three access modifiers, public, protected, and private, are used to specify where the 
declared member (class, field, constructor, or method) can be used. If none of these is specified, 
then the entity has package access, which means that it can be accessed from any class in the 
same package. 


The modifier final has three different meanings, depending upon which kind of entity it 
modifies. If it modifies a class, final means that the class cannot be extended to a subclass. (See 
Chapter 9.) If it modifies a field or a local variable, it means that the variable must be initialized 
and cannot be changed, that is, it is a constant. If it modifies a method, it means that the method 
cannot be overridden in any subclass. 


The modifier static means that the member can be accessed only as an agent of the class 
itself, as opposed to being bound to a specific object instantiated from the class. For example, the 
format() method, invoked at line 34 in the Line class in Example 1.5 on page 6 is a static 
method: 


return String. format("%d/%d", num, den); 


10 OBJECT-ORIENTED PROGRAMMING [CHAP. 1 


It is bound to the String class itself, accessed as String. format(). On the other hand, the 
value() method, invoked at line 7 in the test program is a nonstatic method. It is bound to the 
object x, an instance of the Ratio class, and is accessed x. value(). 

A static method is also called a class method; a nonstatic method is also called an instance 
method. The object to which an instance method is bound in an invocation is called its implicit 
argument for that invocation. For example, the implicit argument in the invocation 
x.equals(xx) is the object x. (xx is the explicit argument.) Note that every program’s main() 
method is a static method. 


COMPOSITION, AGGREGATION, AND INHERITANCE 


There are several different ways to associate one class with another: composition, aggregation, 
and inheritance. 

When a class A contains references to instances of a class B and controls all access to those 
instances, we say that A is a composition of B. For example, a University class would be a 
composition of Department objects. Each Department object belongs to a unique University 
object, which controls access to its department. If A is a composition of B, we say that an A object 
“owns a” B object. For example, a university owns a department. 

When a class A contains references to a class B whose 
instances exist and are accessible outside of A, we say that A University 
is an aggregation of B. For example, in a university software 
system, a Department class would contain references to 
Professor objects who are members of the department, but 
who also exist outside the department. In fact, a professor 
could be a member of two different departments. If A is an 
aggregation of B, we say that an A object “has a” B object. 
For example, a department has a professor. 

When a class A includes all the members of a class B, we 
say that A is an extension of B, and that it inherits all the 
properties of B. For example, a Professor class would be y, 
an extension of a Person objects. If A is an extension of B, 
we say that an A object “is a” B object. For example, a 
professor is a person. 

Figure 1.6 illustrates how these three class associations 
are represented in UML. Composition is indicated by a Professor 
filled diamond head adjacent to the composing class, aggre- 
gation is indicated by an empty diamond head adjacent to 
the aggregating class, and inheritance is indicated by an 
empty arrow head adjacent to the extended class. 


Department 


EXAMPLE 1.6 Implementing Associations 


public class Person { 


private final String name; Person 


this.name = new String(name) ; 


1 
2 
3 
4 public Person(String name) { 
: } Figure 1.6 Class associations 
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8 public String getName() { 
9 return new String(name) ; 
10 } 

1 } 


Instances of the Person class represent people. 


1 public class Professor extends Person { 

2 public static enum Rank {INSTR, ASST, ASSOC, PROF} 
3 

4 private Rank rank; 

5 

6 public Professor(String name, Rank rank) { 
7 super (name) ; 

8 this.rank = rank; 

9 } 

10 

1 public Rank getRankQd { 

12 return rank; 

13 } 

14 

15 public void setRank(Rank rank) { 

16 this.rank = rank; 

17 } 

18 } 


The Professor class extends the Person class, inheriting its name field and its getName method. It 
defines at line 2 an enum field named Rank that specifies the four values INSTR, ASST, ASSOC, and PROF. 
Its constructor at line 6 requires the professor’s name and rank. Note how the Professor constructor uses 
the super keyword to invoke the Person constructor at line 7. 

1 public class University { 


2 private static class Department { 

3 final String name; 

4 Set<Professor> members; 

5 

6 public Department(String name) { 

7 this.name = new String(name) ; 

8 this.members = new HashSet<Professor>(); 
9 } 

10 

1 public void add(Professor professor) { 

12 members.add(professor) ; 

13 } 

14 } 

15 

16 private final String name; 

17 private Map<String, Department> departments; 
18 

19 public University(String name) { 

20 this.name = new String(name) ; 

21 this.departments = new TreeMap<String, Department>() ; 
22 } 

23 

24 public String getName() { 

25 return new String(name) ; 


26 } 
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28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 
46 
47 
48 


} 
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public void addDepartment(String name, Set<Professor> members) { 
Department dept = new Department (name) ; 
departments.put(name, dept); 
for (Professor prof : members) { 
dept.add(prof) ; 
} 
} 


public void add(Professor prof, String deptName) { 
Department dept = departments. get (deptName) ; 
if (dept == null) { 
throw new RuntimeException(deptName + " does not exist."); 
} else { 
dept.add(prof) ; 
} 
} 


public Set<String> departments() { 
return departments.keySet(); 


[CHAP. 1 


The University class is a composite of Department objects. The existence of a department is depen- 


dent upon the existence of its university. Therefore, the Department class should be completely 
controlled and insulated by the University class. This is done by defining it to be a nested private 
static class at line 2. 


Professors). It includes an add() method at line 11 for adding professors to the department. 


The University.Department class has two fields: name (a String), and members (a Set of 


The University class has two fields: name (a String), and departments (a Map of Department 


objects, indexed by their names). It includes two addQ methods (at lines 28 and 36) and an accessor 
method that returns the Set of department names (at line 45). 


Note that the University.Department class is an aggregate of Professor objects. The existence of 


a professor is independent of his or her department’s existence. Therefore, the Professor class is defined 
separately from the University.Department class. 
public class TestUniversity { 


1 


2 
3 
4 
5 
6 
7 
8 
9 


} 


public static void main(String[] args) { 
University jsu = new University("JavaStateUniversity") ; 
Professor adams new Professor("Adams", Professor.Rank.ASSOC) ; 
Professor baker new Professor("Baker", Professor.Rank.ASST) ; 
Professor cohen = new Professor("Cohen", Professor.Rank.PROF) ; 
Set<Professor> profs = new HashSet<Professor>(); 
Collections.addAll(profs, adams, baker, cohen); 
jsu.addDepartment("Computer Science", profs); 
Professor davis = new Professor("Davis", Professor.Rank.ASST) ; 
Professor evans = new Professor("Evans", Professor.Rank.INSTR) ; 
profs.clear(); 
Collections.addAll(profs, davis, evans, baker); 
jsu.addDepartment("Biology", profs); 
adams.setRank(Professor. Rank. PROF) ; 

} 


that Prof. Baker is a member of both departments. 


This test program creates the university with two departments, each containing three professors. Note 
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The departments’ aggregation of professors is evidenced by two features of this program: A profes- 
sor may belong to more than one department, and a professor’s attributes may be changed independently 
of his or her department: Prof. Adams is promoted to Professor. Rank. PROF at line 15. 


Example 1.6 uses several classes that are defined in the Java Collections Framework (JCF). 
This library is part of the java.util package. The JCF is outlined in Chapter 4. 

Example 1.6 also uses two new Java features, introduced with Java 5.0: enum types, and the 
for-each construct. 


THE UNIFIED MODELING LANGUAGE 
The Unified Modeling Language (UML) is illustrated in Figure 1.6 on page 10 and shows the 


symbols representing three kinds of association between classes. These are summarized in Table 
1.6. 


Association Symbol 
Composition o___—_- 
Aggregation > 

Inheritance > 


Implementation} ~~ = = = _ > 


Table 1.6 UML symbols 


The implementation symbol is used to show that a class implements an interface. 
EXAMPLE 1.7 Implementing the Comparab1e Interface 


Change line 1 of Example 1.5 on page 6 to: 
1 public class Ratio implements Comparable { 
and then add these two methods: 


2 public int compareTo(Object object) { 

3 if (object==this) { 

4 return 0; 

5 } else if (!Cobject instanceof Ratio)) { 

6 throw new Il1llegalArgumentException("Ratio type expected"); 
7 } 

8 Ratio that = (Ratio)object; 

9 normalize(this); 

10 normalize(that) ; 

1 return (this.num*that.den - that.num*this.den); 

12 } 

13 

14 private static void normalize(Ratio x) { 

15 if (x.num == 0) { // x == Ratio.ZERO 

16 x.den = 1; 

17 } else if (x.den < 0) { // change sign of num and den: 
18 X.num *= -1; 

19 x.den *= -1; 

20 } 
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The Comparab1e interface requires the compareTo() method, as specified at line 2. The purpose of the 
method is to indicate whether the implicit argument is less than, equal to, or greater than the explicit 
argument. The indicator is the sign of the returned integer: Negative indicates that this < object, zero 
indicates that this = object, and positive indicates that this > object. 

In the case of ratios a/b and c/d, we can tell whether a/b < c/d by cross-multiplying and checking 
whether ad < bc. Since that case should be indicated by returning a negative number, we can simply return 
ad — bc. Indeed, the value of that expression will be negative, zero, or positive, when a/b < c/d, a/b = c/d, 
or a/b > c/d, respectively. However, that arithmetic trick works only if b > 0 and d> 0. 

To ensure that the denominators are not negative, we 


employ the static utility method defined at line 14. It 
«interface» 
Comparable 


ensures that the denominator of its argument is not 
+ compareTo(Object): int 


negative. 


Figure 1.7 illustrates how the implementation of 
an interface is represented in UML. The association 
symbol is a dashed arrow pointing to the interface. a 
The diagram for the interface marks its name with 
the stereotype «interface». 

Note that an interface rectangle has only two 
parts, while a class rectangle has three. This is 
because interfaces have no fields. 

Figure 1.7 also illustrates how to represent 
package membership. The Comparab]e interface is Figure 1.7 Interface implementation in UML 
part of the java. lang package, as indicated by the 
tab above the interface symbol. 


POLYMORPHISM 


Java is a strongly typed language, which means that every variable must be declared to have a 
type which determines how that variable can be used. A char cannot be used where a boolean is 
expected, and a Ratio object cannot be used where a Person object is expected. But there are 
some situations where a variable of one type can be used where another type is expected. This is 
called polymorphism (literally, “many forms”) because the object appears to have more than one 
type. 

There are several kinds of polymorphism. The simplest kind is called inclusion polymorphism. 
Illustrated in Example 1.8, it refers to the ability of an object to invoke a method that it inherits. 


EXAMPLE 1.8 Inclusion Polymorphism 


public class TestRatio { 
public static void main(String[] args) { 
Ratio x = new Ratio(22, 7); 
System.out.printIn("x.hashCode(): " + x.hashCode()); 
} 
} 
The output is: 
x. hashCodeQ): 1671711 
At line 4, the Ratio object x invokes the hashCode() method that it inherits from the Object class. 


Oa ak WHY =| 
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Another kind of polymorphism occurs with generic methods, where an actual type is substi- 
tuted in for a type parameter. This is illustrated in Example 1.9. 


EXAMPLE 1.9 Parametric Polymorphism 


1 public class TestSort { 
2 public static void main(String[] args) { 

3 String[] countries = {"CN", "IN", "US", "ID", "BR"}; 
4 print(countries); 

5 Arrays.sort(countries) ; 

6 print(countries); 

7 Ratio[] ratios = new Ratio[3]; 

8 ratios[0] = new Ratio(22, 7); 

9 ratios[1] = new Ratio(25, 8); 


10 ratios[2] = new Ratio(28, 9); 
1 print(ratios); 

12 Arrays.sort(ratios) ; 

13 print(ratios); 

14 } 

15 

16 static <T> void print(T[] a) { // generic method 
17 for (Tt:a)f{ 

18 System.out.printf("%s ", t); 
19 } 

20 System.out.printiInQd; 

21 } 

22 } 


Here is the output: 
CN IN US ID BR 
BR CN ID IN US 
22/7 25/8 28/9 
28/9 25/8 22/7 
The print () method defined at line 16 is a generic method. It uses the type parameter T as a place- 
holder for an actual type, which will be determined at run time. When the generic method is invoked at 
lines 4 and 6, the type String is used in place of T. When it is invoked at lines 11 and 13, the type Ratio 
is used. This makes the method polymorphic, capable of printing arrays of String objects and arrays of 
Ratio objects. 
The program also uses the generic sort() method that is defined in the java.util .Arrays class. It 
requires the type parameter T to be an extension of the Comparable interface, which both the String 
class and our Ratio class are. 


Inclusion polymorphism and parametric polymorphism are both special cases of universal 
polymorphism. The other general kind of polymorphism is called ad hoc polymorphism which 
also has two special kinds, named overloading polymorphism and coercion. These are best illus- 
trated with primitive types. 

The plus operator (+) is polymorphically overloaded: It means integer addition when used in 
the form 22 + 33; it means floating point addition when used in the form 2.2 + 3.3; and it 
means string concatenation when used in the form name + ", Esq. 

Coercion occurs when a value of one type is implicitly converted to a value of another type 
when the context requires that other type. For example, when the compiler evaluates the expres- 
sion 22 + 3.3, it interprets the plus operator as floating point addition, requiring both operands 
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to be either type float or type doub1e. So it “coerces” the 22 into being 22 .0 and then performs 
the operation. 


JAVADOC 


A Java program is a main class together with the other classes that it uses. Most classes are 
part of the standard Java Class Libraries provided by Sun Microsystems, Inc. These libraries are 
orgainzed into several package hierarchies, known collectively as the Application Programming 
Interface (API). 

The two most common package hierarchies are the java and javax hierarchies, parts of 
which are shown in Figure 1.8. 

java 
applet 


sa 
color 


event 
io 
lang 
ie reflect 


math 
text 


til 
: [ea regex 
i ae rowset 


swing 


javax 


border 
event 

table 

text 

tree 

text 

util 

regex 


Figure 1.8 Parts of the java and javax package hierarchies 


Complete documentation is provided for all the classes in the Java API. Also called Java API 
Specification, these javadocs are located at: 
http://java.sun.com/javase/6/docs/api/ 
The Javadoc for the String class, in the java.lang package, is shown in Figure 1.9 on page 17. 


Review Questions 


1.1. What is a requirements document? 


1.2. What is the difference between the design stage and the implementation stage in the develop- 
ment of software? 
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String (Java Platform SE 6) - Mozilla Firefox 
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javalang 
Class String 


java. lang. Object 
L4ava.lang. String 


All Implemented Interfaces: 
Serializable, CharSequence, Comparable<Stnng> 


public final class String 
extends Object 
implements Serializable, Comparable<String>, CharSequence 


The String class represents character strings. All string literals in Java programs, such as "abc", are 
implemented as instances of this class. 


Strings are constant, their values cannot be changed after they are created. String buffers support 
mutable strings. Because String objects are immutable they can be shared. For example: 


String str = "abc"; 


is equivalent to: 


char data[] = {'a', ‘b', ‘c'}: 
String str = nev String(data); 


Here are some more examples of how strings can be used: 


System.out.println("abc"): 
String cde = "cde"; 
System.out.println("abc” + cde); 
String c "abc". substring(2,3)- 
String d cde.substring(1, 2); 


Figure 1.9 Javadoc for the java.lang.String class 


What is the difference between the state and the behavior of a class? 


What is an abstract data type? 


What constitutes a Java program? 


What kinds of members can a class have? 


What is an implicit argument? 


What is the purpose of the toString() method? 


What is the purpose of the equals() method? 


1.10 What's the difference among public, protected, and private? 
1.11. What is a package? 


1.12 What is the difference between an abstract class and an abstract data type? 
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1.13 
1.14 
1.15 


1.16 


1.1 


1.2 


1.3 
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What is the difference between a constructor and a method? 
What is the difference between a class method and an instance method? 


What is the difference between equality of objects and equality of the references that refer to 
them? 


Explain the difference between the output from 
String s; 


System.out.printIn("s = " + s); 
and the output from 

String s = new StringQ); 

System.out.printIn("s = " + s); 


What is the purpose of declaring a field private and declaring a mutator method that allows 
the public to change it. Wouldn’t it be just as good to just make it public? 


What is an enum type? 
What is the difference between composition and aggregation? 


What is polymorphism? 


Problems 


Translate this ADT into a Java interface: 
ADT: Point 
amplitude(): Real 
distanceTo(Point): Real 
equals(Point): Boolean 
magnitude(): Real 
toString(Q): String 
xCoordinate(): Real 
yCoordinate(): Real 


Translate this ADT into a Java interface: 
ADT: Line 
contains(Point): Boolean 
equals(Line): Boolean 
isHorizontal(): Boolean 
isVerticalQ): Boolean 
slope(): Real 
toString(Q): String 
xInterceptQ): Real 
yIntercept(Q): Real 


Translate this ADT into a Java interface: 
ADT: Circle 
area(): Real 
center(): Point 
circumference(): Real 
contains(Point): Boolean 
equals(Circle): Boolean 
radius(): Real 
toString(): String 
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1.4 


1.5 
1.6 
1.7 
1.8 


1.1 


1.2 


1.3 


1.13 


Translate this ADT into a Java interface: 
ADT: Polynomial 
degree(Q): int 
derivative(): Polynomial 
equals(Point): Boolean 
sum(Polynomial): Polynomial 
toStringQ): String 
valueAt(Real): Real 


Implement the ADT in Problem 1.1 with a Java class, and draw its UML diagram. 
Implement the ADT in Problem 1.2 with a Java class, and draw its UML diagram. 
Implement the ADT in Problem 1.3 with a Java class, and draw its UML diagram. 
Implement the ADT in Problem 1.4 with a Java class, and draw its UML diagram. 


Answers to Review Questions 


The requirements document of a software development project is a precise specification of what the 
software should do. 


In the development of software, the design stage identifies the components (classes) to be used and the 
relationships among them. The implementation stage is where the computer progam code is actually 
written. 


The state of a class consists of the values of its fields. The behavior of a class is defined by its meth- 
ods. 


An abstract data type is a specification of the type’s operations: what an instance of that type can do. 


A Java program is a Java class with a main() method? The main() method must have this header: 
public static void main(String[] args) 


A class member may be a field, a constructor, a method, a nested class, a nested interface, or an enum 
type. 

The implicit argument of a method is the object to which the method invocation is bound. 

The toString() method returns a String object that represents the state of its implicit argument. 


The equals() method returns true if and only if states (contents) of its implicit and explicit arguments 
are the same. 


A class member that is declared public is accessible from any other class. A class member that is 
declared protected is accessible only from its own class and subclasses (extensions). A class mem- 
ber that is declared private is accessible only from its own class. 


A package is a namespace, that is, a name for a group of classes, interfaces, and enum types that can be 
used to distinguish those from other classes, interfaces, and enum types with the same name. 


An abstract class is a Java class with at least one abstract method—a method that has no body. An 
abstract data type is a specification of a type’s operations that could be implemented in any object-ori- 
ented programming language. 


A constructor is a member function of a class that is used to create objects of that class. It has the same 
name as the class itself, has no return type, and is invoked using the new operator. A method is an ordi- 
nary member function of a class. It has its own name, a return type (which may be void), and is 
invoked using the dot operator. 
A class method is declared static and is invoked using the class name. For example, 

double y = Math.abs(x); 
invokes the class method abs (© that is defined in the Math class. An instance method is declared with- 
out the static modifier and is invoked using the name of the object to which it is bound. For example, 
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double x = random.nextDouble(); 
invokes the class method nextDoub1e() that is defined in the Random class and is bound to the object 
random which is an instance of that class. 


Two objects should be equal if they have the same data values (1.e., the same state). Two references are 
equal if they refer to the same object. The condition (p == q) tests equality of the references p and q, 
not the equality of the objects to which they refer. 


The output from the code 
String s; 
System.out.printIn("s = " + s); 

is 
s = null 

The output from the code 
String s = new String(); 
System.out.printIn("s = 


+ S)3 
is 
SsS= 
In the first case, the reference s is initialized by default to be nu11; there is no String object. In the 
second case, s is initialized to refer to the empty String object. 


The advantage of forcing the public to use a mutator method to change a field is that you can control 
how the field is changed. 


An enum type is a type, defined with the enum keyword that lists by name each possible value for the 
type. 


When a type is composed of another type, the complete existence of that component type’s objects is 
controlled by the composing type’s object; the components are uniquely bound to it and cannout be 
changed by any outside object. With aggregation, the component elements exits outside of the collec- 
tion, can be changed by other classes, and may even be components of other aggregates. 


Polymorphism describes the way an object or variable may be treated in different contexts as though it 
has a different type. For example, inheritance allows an argument of type B to be passed to a parameter 
of type A if B extends A. 


Solutions to Problems 


public interface Point { 
public double amplitude() ; 
public double distanceTo(Point point); 
public boolean equals(Object object); 
public double magnitude() ; 
public String toString(); 
public double xCoordinate() ; 
public double yCoordinate(); 
} 


public interface Line { 
public boolean contains(Point point); 
public boolean equals(Object object) ; 
public boolean isHorizontal(); 
public boolean isVertical(); 
public double slope(); 
public String toString(); 
public double xIntercept(); 
public double yIntercept(); 
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1.3 public interface Circle { 
public double area(); 
public Point center(); 
public double circumference() ; 
public boolean contains(Point point); 
public boolean equals(Object object); 
public double radius(); 
public String toString(); 
} 


1.4 public interface Polynomial { 
public int degree); 
public Polynomial derivative(); 
public boolean equals(Object object); 
public Polynomial sum(Polynomial polynomial) ; 
public String toString(); 
public double valueAt(double x); 
} 


1.5 public class MyPoint implements Point { 
private double x, y; 
public static Point ORIGIN = new MyPoint(); 


private MyPoint() { 
} 


public MyPoint(double x, double y) { 
this.x = x; 
this.y = y; 

} 


ll 


public double amplitude() { 
return Math.atan(y/x) ; 
} 


public double distanceTo(Point point) { 
if (point.equals(this)) { 
return 0.0; 
} else if (! (point instanceof MyPoint)) { 
throw new I]legalArgumentException("use a MyPoint object"); 
} else { 
MyPoint that = (MyPoint) point; 
double dx = that.x - this.x; 
double dy = that.y - this.y; 
return Math.sqrt(dx*dx + dy*dy); 
} 
} 


public boolean equals(Object object) { 
if (Cobject==this) { 
return true; 
} else if (!Cobject instanceof MyPoint)) { 
return false; 
} 
MyPoint that = (MyPoint)object; 
return (that.x == this.x && that.y == this.y); 
} 
public double magnitude() { 
return Math.sqrt(x*x + y*y); 
} 
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public String toString() { 
return String. format("(%.27,%.2f)", x, y); 
} 


public double xCoordinate() { 
return x; 


} 


public double yCoordinate() { 
return y; 
} 
} 


public class MyLine implements Line { 
private double m, b; // slope, intercept 
public static Line X_AXIS = new MyLine(); 


private MyLine() { 
} 


public MyLine (double m, double b) { 
this.m = m; 
this.b = b; 

} 


public boolean contains(Point point) { 
double x = point.xCoordinate() ; 
double y = point.yCoordinate() ; 
return y == m*x + b; 


} 


public boolean equals(Object object) { 
if Cobject==this) { 
return true; 
} else if (!Cobject instanceof MyLine)) { 
return false; 
} 
MyLine that = (MyLine)object; 
return (that.m == this.m && that.b == this.b); 
} 


public boolean isHorizontald) { 
return m == 0.0; 


} 


public boolean isVertical(d) { 
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return m == Double.POSITIVE_INFINITY || m==Double.NEGATIVE_INFINITY; 


} 


public double sloped) { 
return m; 


} 


public String toString() { 
return String.format("y = %.2fx + %.2f", m, b); 
} 
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public double xInterceptQ) { 
if CisHorizontal(d)) { 
throw new RuntimeExceptionC("this line is horizontal"); 
} 
return -b/m; 


} 


public double yInterceptQ) { 
if CisVerticald) { 
throw new RuntimeExceptionC("this line is vertical"); 
} 
return b; 
} 
} 


public class MyCircle implements Circle { 
private Point c; // center 
private double r; // radius 


public MyCircle() { 
} 


public MyCircleC(Point c, double r) { 
this.c = c; 
this.r =r; 


} 


public double area() { 
return Math.PI*r*r; 


} 


public Point center() { 
return Cc; 


} 


public double circumference() { 
return 2*Math.PI*r; 
} 


public boolean contains(Point point) { 
double x = point.xCoordinate() ; 
double y = point.yCoordinate() ; 
return x*x + y*y < r*r; 


z 


public boolean equals(Object object) { 
if (Cobject==this) { 
return true; 
} else if (!Cobject instanceof MyCircle)) { 
return false; 


} 

MyCircle that = (MyCircle)object; 

return (that.c == this.c && that.r == this.r); 
} 


public double radius() { 
return r; 


; 
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public String toString() { 
return String.format("[Center: %.2fx; Radius: %.2f]", c, 1); 
} 
} 


public class MyPolynomial implements Polynomial { 
private double[] c; // coefficients 


public MyPolynomial(double[] a) { // ali] = coeffficient of xAi 
int n = c.length; 
c = new double[n]; 
System.arraycopy(a, 0, c, 0, n); 
} 
public int degree() { 
return c.length - 1; 


} 


public Polynomial derivative() { 
double da[] = new double[c. length-1]; 
for Cint i=0; i<da.length; i++) { 
dali] = (i+1)*c[i+1]; 
} 
return new MyPolynomial (da) ; 


} 


public boolean equals(Object object) { 

if (Cobject==this) { 
return true; 

} else if (! (object instanceof MyPolynomial)) { 
return false; 

} 

MyPolynomial that = (MyPolynomial)object; 

return java.util.Arrays.equals(that.c, this.c); 


} 


public Polynomial sum(Polynomial p) { 
if C!(p instanceof MyPolynomial)) { 
throw new I]legalArgumentException("use a MyPolynomial object"); 
} 
MyPolynomial that = (MyPolynomial)p; 
double[] pc = that.c; 
int n = Math.max(c. length, pc.length); 
MyPolynomial q = new MyPolynomial (new double[n]); 
for Cint i=0; i<n; i++) { 
q.cli] = cli] + pci]; 
} 
return q; 


} 


public String toString() { 

StringBuilder buf = new StringBuilder(); 

int n = c.length; 

if (n> 0 &&c[0] != 0.0) f{ 
buf.append(c[0]); 

} 

if (n> 1 &&c[1] != 0.0) { 
buf.append(String.format(" + %.2f", c[1])); 

} 
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for Cint i=2; i<n; i++) { 
if (c[i] != 0.0) { 
buf.append(String.format(" + %.2fA%d", c[i], i)); 
} 


} 
return buf.toStringQ ; 


} 


public double valueAt(double x) { 
double y = 0.0; 
for Cint i=0; i<c. length; i++) { 
y += c[i]*Math.pow(x, 1); 


return y; 


} 


2D 


Arrays 


An array is an object that consists of a sequence of elements that are numbered 0, 1, 2,... The 
element numbers are called index numbers. Array elements can be accessed by their index 
numbers using the subscript operator [], as a[0], a[1], a[2], and so on. 

Arrays are widely used because they are so efficient. 


PROPERTIES OF ARRAYS 


Here are the main properties of arrays in Java: 


20. 


Arrays are objects. 

Arrays are created dynamically (at run time). 

Arrays may be assigned to variables of type Object. 

Any method of the Object class may be invoked on an array. 

An array object contains a sequence of variables. 

The variables are called the components or elements of the array. 

If the component type is T, then the array itself has type T[]. 

An array type variable holds a reference to the array object. 

The component type may itself be an array type. 

An array element is a component whose type is not an array type. 

An element’s type may be either primitive or reference. 

The /ength of an array is its number of components. 

An array’s length is set when the array is created, and it cannot be changed. 
An array’s length can be accessed as a public final instance variable. 
Array index values must be integers in the range 0...length —1. 

An ArrayIndexOutOfBoundsException is thrown if Property 15 is violated. 
Variables of type short, byte, or char can be used as indexes. 

Arrays can be duplicated with the Object.clone() method. 

Arrays can be tested for equality with the Arrays.equals() method. 

Array objects implement Cloneable and java.io.Serializable. 


Property 3 follows from Property 1. Although array types are not classes, they behave this way 
as extensions of the Object class. Property 7 shows that array types are not the same as class 
types. They are, in fact, derived types: For every class type T there is a corresponding array type 
T[]. Also, for each of the eight primitive types, the corresponding array type exists. 
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Property 9 allows the existence of arrays of arrays. Technically, Java allows multidimensional 
arrays only with primitive types. But for objects, an array of arrays is essentially the same thing. 
Since arrays themselves are objects, an array of arrays is an array of objects, and some of those 
component objects could also be nonarrays. (See Example 2.1.) 

Note that a consequence of Property 13 is that changing a reference component value to nu11 
has no effect upon the length of the array; nu11 is still a valid value of a reference component. 


EXAMPLE 2.1 Some Array Definitions 


Here are some valid array definitions: 
public class ArrayDefs { 


1 

2 public static void main(String[] args) { 

3 float x[]; 

4 x = new float[100]; 

5 args = new String[10]; 

6 boolean[] isPrime = new boolean[1000] ; 

7 int fib[] = {0, 1, 1, 2, 3, 5, 8, 13}; 

8 short[][][] b = new short[4] [10] [5]; 

9 double a[][] = {{1.1,2.2}, {3.3,4.4}, null, {5.5,6.6}, null}; 
10 a[4] = new double[66]; 

11 a[4] [65] = 3.14; 

12 Object[] objects = {x, args, isPrime, fib, b, a}; 
13 } 

14 } 


Line 3 declares x[] to be an array of floats but does not allocate any storage for the array. Line 4 
defines x[] to have 100 Float components. 

Line 5 declares args[] to be an array of 10 String objects. Note the two different (equivalent) ways 
to declare an array: The brackets may be a suffix on the type identifier or on the array identifier. Line 5 
defines args[] to have 10 String components. 

Line 6 defines isPrime[] to be an array of 1000 boolean variables 

Line 7 defines fib[] to be an array of 8 ints, initializing them to the 8 values listed. So for example, 
fib[4] has the value 3, and fib[7] has the value 13. 

Line 8 defines b[][][] to be a three-dimensional array of 4 components, each of which is a two- 
dimensional array of 10 components, each of which is a one-dimensional array of 5 component elements 
of type short. 

Line 9 defines a[][] to be an array of five components, each of which is an array of elements of type 
double. Only three of the five component arrays are allocated. Then line 10 allocates a 66-element array 
of doubles to a[4], and line 11 assigns 3.14 to its last element. 

Line 12 defines the array objects to consist of six components, each of which is itself an array. The 
components of the first four component arrays are elements (nonarrays). But the components of the 
components b and a are not elements because they are also arrays. The actual elements of the objects 
array include 2, 5, and 13 (components of the component fib), nul] (components of the component a), 
and 2.2 and 3.14 (components of the components of the component a). 


The array a[][] defined in Example 2.1 is called a ragged array because it is a two-dimen- 
sional array with rows of different lengths. 

The element type of an array in Java can be a primitive type, a reference type, or an array type. 
The simplest, of course, are arrays of primitive type elements, such as x[], isPrime[], and fib[] 
in Example 2.1. These are arrays that can be sorted. 
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DUPLICATING AN ARRAY 


Since it is an object, an array can be duplicated by invoking the Object.clone() method, as 
shown in Example 2.2. 


EXAMPLE 2.2 Duplicating an Array 


15 public class DuplicatingArrays { 

16 public static void main(String[] args) { 
17 int[] a = {22, 44, 66, 88}; 

18 print(a); 

19 int{] b = Cint[])a.cloneQ; // duplicate a[] in b[] 
20 print(b) ; 

21 String[] c = {"AB", "CD", "EF"}; 

22 print(c); 

23 String[] d = (String[])c.cloneQ; // duplicate c[] in d[] 
24 print(d); 

25 c[1] = "XYZ"; // change c[], but not d[] 
26 print(c); 

27 print(d) ; 

28 } 

29 

30 public static void printCint[] a) { 

31 System.out.printfC"{%d", a[0]); 

32 for Cint i = 1; 1 < a.length; i++) { 
33 System.out.printfC", %d", a[i]); 

34 } 

35 System.out.printIn("}"); 

36 } 

37 

38 public static void print(Object[] a) f{ 
39 System.out.printfC"{%s", a[0]); 

40 for Cint i = 1; 1 < a.length; i++) { 
a System.out.printfC", %s", a[i]); 

42 } 

43 System.out.printIn("}"); 

44 } 

45 } 


The output is: 
{22, 44, 66, 88} 
{22, 44, 66, 88} 


{AB, CD, EF} 
{AB, CD, EF} 
{AB, XYZ, EF} 
{AB, CD, EF} 


The array a[] contains four int elements. The array b[] is a duplicate of a[]. Similarly, the 
array d[] is a duplicate of the array c[], each containing three String elements. In both cases, 
the duplication is obtained by invoking the clone() method. Since it returns a reference to an 
Object, it must be cast to the array type being duplicated, int[] or String[]. 

The last part of the example shows that the cloned array d[] is indeed a separate copy of c[1]: 
Changing c[1] to "XYZ" has no effect upon the value "CD" of d[1]. 
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THE java.util.Arrays CLASS 

Java includes a special “utility” class for processing arrays. The name of this class is Arrays, 
and it is defined in the java.util package. 
EXAMPLE 2.3 Using the java.util .Arrays Class 


This program imports the Arrays class from the java.util package to access the sort(), 
binarySearch(), £1110, and equals() methods. It also imports the static print() method from 


Example 2.2. 

1 import java.util.Arrays; 

2 

3 public class TestArrays { 

4 public static void main(String[] args) { 

5 int[] a = {44, 77, 55, 22, 99, 88, 33, 66}; 

6 print(a); 

7 Arrays.sort(a) ; 

8 print(a); 

9 int k = Arrays.binarySearch(a, 44); 

10 System.out.printfC"Arrays.binarySearch(a, 44): %d%n", k); 
11 System.out.printfC"a[%d]: %d%n", k, a[k]); 

12 k = Arrays.binarySearch(a, 45); 

13 System.out.printfC"Arrays.binarySearch(a, 45): %d%n", k); 
14 int[] b = new int[8]; 

15 print (b); 

16 Arrays.fill(b, 55); 

17 print (b); 

18 System.out.printInC"Arrays.equals(a,b): " + Arrays.equals(a,b)); 
19 } 

20 } 

The output is 


44 77 55 22 99 88 33 66 

22 33 44 55 66 77 88 99 
Arrays.binarySearch(a, 44): 2 
a[2]: 44 
Arrays.binarySearch(a, 45): -4 
00000000 

55 55 55 55 55 55 55 55 
Arrays.equals(a,b): false 

The array a[] is created and printed at lines 5—6. At line 7, the call Arrays.sort(a) sorts the elements 
of the array, putting them in ascending order, as we can see from the output from line 8. 

At line 9, the Arrays.binarySearch() method is invoked. The second argument, 44, is the search 
target. The method returns the index 2, which is assigned to k at line 9. Line 11 verifies that 44 is indeed 
the value of a[2]. 

The method is invoked again at line 13, this time searching for the target 45. The value is not found in 
the array, so the method returns a negative number, k = —4. When this happens, the index i =— — 1 will be 
the position in the array where the target element should be inserted to maintain the ascending order of the 
array. Note that, in this case, i = —-k — 1 = 3, and 45 should be inserted at a[3] since there are three 
elements in the array that are less than 45. 

The output from line 17 shows how the Arrays.fi11Q method works: It filled the eight-element 
array b[] with the argument 55. 
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Finally, line 18 shows how the Arrays.equals() method works. It will return true only if the two 
arrays have the same element type (as a[] and b[] do: int[]), the same length (as a[] and b[] do: 8), 
and the same values at each element (a[] and b[] do not). 


The java.util.Arrays class is outlined in more detail in page 95. 
THE SEQUENTIAL SEARCH ALGORITHM 


The sequential search (also called the linear search) is the simplest search algorithm. It is also 
the least efficient. It simply examines each element sequentially, starting with the first element, 
until it finds the key element or it reaches the end of the array. 

If you were looking for someone on a moving passenger train, you would use a sequential 
search. 

Here is the sequential search algorithm: 

(Postcondition: either the index i is returned where s; = x, or —1 is returned.) 

1. Repeat steps 2-3, fori=O0ton-—1. 
2. (Invariant: none of the elements in the subsequence {s...s,,} is equal to x.) 
3. Ifs;= x, return i. 
4. Return —-1. 
It is implemented in Example 2.4. 


EXAMPLE 2.4 The Sequential Search 


1 public class TestBinarySearch { 

2 public static void main(String[] args) { 

3 int[] a = {22, 33, 44, 55, 66, 77, 88, 99}; 

4 ch02.ex02.DuplicatingArrays.print(a) ; 

5 System.out.printInC"search(a, 44): " + search(a, 44)); 
6 System.out.printInC"search(a, 50): " + search(a, 50)); 
7 System.out.printInC"search(a, 77): " + search(a, 77)); 
8 System.out.printInC"search(a, 100): " + search(a, 100)); 
9 } 

10 

"1 public static int searchCint[] a, int x) { 

12 // POSTCONDITIONS: returns an integer ji; 

13 // if i >= 0, then a[i] == x; otherwise x is not in a[]; 
14 for Cint i=0; i<a.length; i++) { // step 1 
15 // INVARIANT: x 1S not among a[O]...a[i-1] // step 2 
16 if cali] == x) { // step 3 
17 return 1; 

18 } 

19 } 

20 return -1; // step 4 
21 } 

22 } 


The output is: 
{22, 33, 44, 55, 66, 77, 88, 99} 
search(a, 44): 2 
search(a, 50): -1 
search(a, 77): 5 
search(a, 100): -1 
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The search() method returns the index of the target x: search(a, 44) returns 2, because a[2] = 44; 
search(a, 77) returns 5, because a[5] = 77. The method returns —1 when the target is not in the array: 
search(a, 50) returns —1, because 50 is not in the array. 


The sequential search is correct. This means that it works. The following argument is a 
proof of that fact. 

If n = 0, then the sequence is empty and the loop does not execute at all. Only step 4 executes, 
immediately returning —1. This satisfies the postconditions: x cannot equal any of the elements 
because there aren’t any. 

If nm = 1, then the loop iterates only once, with i = 0. On that iteration, either sy = x or sy # x. If 
Sq =x, then 0 is returned and the postcondition is satisfied. If sy) # x, then the loop terminates, step 
4 executes, and —1 is returned, and that satisfies the postcondition because the single element of 
the sequence is not equal to x. 

Suppose 7 > 1. We want to apply the First Principle of Mathematical Induction to deduce that 
the loop invariant must be true on every iteration of the loop. (See page 321.) That requires the 
verification of the invariant on the first iteration and the deduction of the invariant on iteration i 
from the corresponding invariant on iteration i—-1. 

On the first iteration of the loop, i = 0, and the loop invariant in step 2 is true “vacuously” 
because the subsequence {s>...5;_;} 1s empty. Then in step 3, either sy = x or sy # x. If sy =x, then 
0 is returned and the postcondition is satisfied. If sy # x, then the loop continues on to a second 
iteration. Then i = 1, and the loop invariant in step 2 is again true because the subsequence 
{5o...S;_1} = {So} and sy # x. 

Suppose now that on iteration i-1, the loop invariant is true; that is, none of the elements in 
the subsequence {5sy..5,_,} is equal to x. If the loop continues on to the next iteration, then the 
condition s; = x at step 3 was not true. Thus, s; 4 x. Therefore, none of the elements in the subse- 
quence {5y..s;} 1s equal to x, which is the loop invariant on the ith iteration 

The sequential search runs in O(n) time. This means that, on average, the running time is 
proportional to the number of elements in the array. So if everything else is the same, then apply- 
ing the sequential search to an array twice as long will take about twice as long, on average. The 
following argument is a proof of that fact. 

If x is in the sequence, say at x = s; with i<n, then the loop will iterate i times. In that case, the 
running time is proportional to 7, which is O(7) since i < n. If x is not in the sequence, then the 
loop will iterate 1 times, making the running time proportional to , which is O(n). 


THE BINARY SEARCH ALGORITHM 


The binary search is the standard algorithm for searching through a sorted sequence. It is 
much more efficient than the sequential search, but it does require that the elements be in order. It 
repeatedly divides the sequence in two, each time restricting the search to the half that would 
contain the element. 

You might use the binary search to look up a word in a dictionary. 

Here is the binary algorithm: 

(Precondition: s = {so, 5, ..., 5, } 1S a sorted sequence of 7 values of the same type as x.) 

(Postcondition: either the index i is returned where s; = x, or —1 is returned.) 

1. Let ss be a subsequence of the sequence s, initially set equal to s. 
2. Ifthe subsequence ss is empty, return —1. 
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(Invariant: If x is in the sequence s, then it must be in the subsequence ss.) 
Let s; be the middle element of ss. 

If s;= x, return its index i. 

If s,<x, repeat steps 2—7 on the subsequence that lies above s,. 

. Repeat steps 2—7 on the subsequence of ss that lies below s;,. 

It is implemented in Example 2.5. 


NAAKRYW 


EXAMPLE 2.5 The Binary Search 


public class TestBinarySearch { 


1 

2 public static void main(String[] args) { 

3 int[] a = {22, 33, 44, 55, 66, 77, 88, 99}; 

4 ch02.ex02.DuplicatingArrays.print(a) ; 

5 System.out.printInC"search(a, 44): " + search(a, 44)); 

6 System.out.printInC"search(a, 50): " + search(a, 50)); 

7 System.out.printInC"search(a, 77): " + search(a, 77)); 

8 System.out.printInC"search(a, 100): " + search(a, 100)); 
9 } 

10 

"1 public static int searchCint[] a, int x) f{ 

12 // POSTCONDITIONS: returns 7; 

13 // if i >= 0, then ali] == x; otherwise i == -1; 
14 int lo = 0; 

15 int hi = a.length; 

16 while (lo < hi) { // step 1 
17 // INVARIANT: if a[j]==x then lo <= j < hi; // step 3 
18 int i = Clo + hi)/2; // step 4 
19 if cali] == x) { 

20 return 1; // step 5 
21 } else if (a[li] < x) f{ 

22 lo = i41; // step 6 
23 t+ else { 

24 hi = 73 // step 7 
25 } 

26 } 

27 return -1; // step 2 
28 } 

29 } 


The output is the same as in Example 2.4. 


The binary search is correct. The loop invariant is true on the first iteration because the 
current subsequence is the same as the original sequence. On every other iteration, the current 
subsequence was defined in the preceding iteration to be the half of the previous subsequence 
that remained after omitting the half that did not contain x. So if x was in the original sequence, 
then it must be in the current subsequence. Thus the loop invariant is true on every iteration. 

On each iteration, either i is returned where s; = x, or the subsequence is reduced by more than 
50 percent. Since the original sequence has only a finite number of elements, the loop cannot 
continue indefinitely. Consequently, the algorithm terminates either by returning 7 from within 
the loop or at step 6 or step 7 where —1 is returned. If 7 is returned from within the loop, then 
s; =x. Otherwise, the loop terminates when hi < 10; that is, when the subsequence is empty. In 
that case we know by the loop invariant that s; is not in the original sequence. 
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The binary search runs in O(ign) time. This means that, on average, the running time is 
proportional to the logarithm of the number of elements in the array. So if everything else is the 
same, if it takes an average of T milliseconds to run on an array of ” elements, then will take an 
average of 27 milliseconds to run on an array of n? elements. For example, if it takes 3 ms to 
search 10,000 elements, then it should take about 6 ms to search 100,000,000 elements! The 
following argument is a proof of that fact. 

Each iteration of the loop searches a subarray that is less than half as long as the subarray on 
the previous iteration. Thus the total number of iterations is no more than the number of times 
that the length 7 can be divided by 2. That number is Ign. And the total running time is roughly 
proportional to the number of iterations that the loop makes. 


Review Questions 


2.1. What is the difference between a component and an element of an array? 
2.2 | What does it mean to say that Java does not allow multidimensional arrays? 


2.3. What is an ArrayIndexOutOfBoundsException exception, and how does its use distin- 
guish Java from other languages such as C and C++? 


2.4. What types are valid for array indexes? 


2.5. What’s wrong with this definition: 
Arrays arrays = new Arrays(); 


2.6 | What is the simplest way to print an array of objects? 


2.7 Ifthe binary search is so much faster than the sequential search, why would the latter ever be 
used? 


2.8 | What happens if the sequential search is applied to an element that occurs more than once in 
the array? 


2.9 | What happens if the binary search is applied to an element that occurs more than once in the 
array? 


Problems 


2.1. Runa test program to see how the Arrays. fi11(Q method handles an array of objects. 


2.2 ‘If the sequential search took 50 ms to run on an array of 10,000 elements, how long would 
you expect it to take to run on an array of 20,000 elements on the same computer? 


2.3. If the binary search took 5 ms to run on an array of 1,000 elements, how long would you 
expect it to take to run on an array of 1,000,000 elements on the same computer? 


2.4 The interpolation search is the same as the binary search except that in step 4 the element s; 
is chosen so that the proportion of elements less than s, in the subsequence ss equals the pro- 
portion that would be expected in a uniform distribution. For example, looking up the name 
“Byrd” in a phone book of 2,600 pages, one would open first near page 200 because one 
would expect about 2/26 of all the names to precede it. The interpolation search can be 
shown to run in O(glgv) time. If it took 5 ms to run on an array of 1,000 elements, how long 
would you expect it to take to run on an array of 1,000,000 elements on the same computer? 
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Run a test driver for the binary search method in Example 2.5 on page 32 on an array of 
10,000 elements and count the number of iterations. 


Write and test this method: 
boolean isSorted(int[] a) 
// returns true iff a[O] <= a[1] <=... <= a[a.length-1] 


Write and test this method: 
int minimumCint[] a) 
// returns the minimum element of a[] 


Write and test this method: 
double mean(Cdouble[] a) 
// returns the average value of all the elements in a[] 


Write and test this method: 
int[] withoutDuplicatesCint[] a) 
// returns the specified array after removing all duplicates 


Write and test this method: 
void reverseC(int[] a) 
// reverses the elements of a[] 


Write and test this method: 
Object[] concatenate(Object[] a, Object[] b) 
// returns an array containing all of a[] followed by all of b[] 


Write and test this method: 
void shuffle(Object[] a) 
// randomly permutes the elements of a[] 


Write and test this method: 
int[] tally(String string) 
// returns an array a[] of 26 integers that count the frequencies 
// of the (case insensitive) letters in the given string 


Write and test this method: 
double innerProduct(double[] x, double[] y) 
// returns the algebraic inner product (the sum of the component- 
// wise products) of the two given arrays as (algebraic) vectors 


Write and test this method: 
double[][] outerProduct(double[] x, double[] y) 
// returns the algebraic outer product of the two given arrays 
// as Calgebraic) vectors: p[iJ[j] = ali]*b[j] 
Write and test this method: 
double[][] product(double[][] a, double[][] b) 
// returns the matrix product of the two given arrays a matrix: 
// plil(j] = Sum(ali] [k]*b(k] [j]:k) 
Write and test this method: 
double[][] transpose(double[][] a) 
// returns the transpose ta of the specified array as a matrix: 
// tafil(j] = aljlti] 
Write and test this method: 
int[][] pascalCint size) 
// returns Pascal’s triangle of the given size 
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The Sieve of Eratosthenes is an array of boolean elements whose ith element is true if and 
only if i is a prime number. Use the following algorithm to compute and print a sieve of size 
1000: 

(Precondition: p is an array of n bits.) 

(Postcondition: p[i] is true if and only if i is prime.) 

1. Initialize p[0] and p[1] to be false, and all other p[i] to be true. 

2. Repeat step 3 for each i from 3 to n, incrementing by 2. 

3. If there is a prime < the square root of i that divides i, set p[i] false. 


Repeat Problem 2.19 using a java.util .Vector object. 
Repeat Problem 2.19 using a java.util .BitSet object. 


Define and test a Primes class with these methods: 
public static void setLast(Cint last) // sets last 


public static void setLast() // sets last=1 

public static void sizeSize(int size) // sets size of bitset 
public static void sizeSize() // sets bitset size=1000 
public static boolean isPrimeCint n) // true if n is prime 
public static int nextQ) // next prime after last 
public static void printPrimes() // prints sieve 


Use the BitSet implementation of the Sieve of Eratosthenes from Problem 2.21. Use these 
definitions: 
public class Primes { 
private static final int SIZE = 1000; 
private static int size = SIZE; 
private static BitSet sieve = new BitSet(size); 
private static int last = 1; 
including this static initializer, which implements the Sieve of Eratosthenes: 
static { 
for Cint i = 2; 1 < SIZE; i++) { 
sieve.set(i); 
} 
for Cint n = 2; 2*n < SIZE; n++) { 
if (sieve.get(n)) { 
for Cint m=n; m*n<SIZE; m++) { 
sieve.clear(m*n) ; 
} 
i 
} 
} 


Add the following method to the Primes class and then test it: 

public static String factor(Cint n) 

// precondition: n> 1 

// returns the prime factorization of n; 

// example: factor(4840) returns "2*2*2*5*11*11" 
Christian Goldbach (1690-1764) conjectured in 1742 that every even number greater than 2 
is the sum of two primes. Write a program that tests the Goldbach conjecture for all even 
numbers less than 100. Use the Primes class from Problem 2.22. Your first 10 lines of output 
should look like this: 

4 = 242 

6 = 3+3 

8 = 345 
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10 = 34+7 = 545 

12 = 5+7 

14 = 3411 = 7+7 

16 = 3413 = 5+11 

18 = 5413 = 7+11 

20 = 3417 = 7413 

22 = 3419 = 5417 = 11+11 


Pierre de Fermat (1601-1665) conjectured that there are infinitely many prime numbers of 
the form n = 2” +1 for some integer p. These numbers are called Fermat primes. For exam- 
ple, 5 is a Fermat prime because it is a prime number and it has the form 2° +1. Write a pro- 
gram that finds all the Fermat primes that are in the range of the int type. Use the Primes 
class from Problem 2.22 and the Math. pow() method. Your first 5 lines of output should 
look like this: 


2A2A0 + 1 = 3 
2A2A1 + 1 = 5 
2A2A2 + 1 = 17 
2A2A3 + 1 = 257 
2A2A4 + 1 = 65537 


Charles Babbage (1792-1871) obtained the first government grant in history when in 1823 
he persuaded the British government to provide £1000 to build his difference engine. In his 
grant proposal, Babbage gave the formula x* + x + 41 as an example of a function that his 
computer would tabulate. This particular function was of interest to mathematicians because 
it produces an unusual number of prime numbers.Primes that have this form n = x? +x + 41 
for some integer x could be called Babbage primes. Write a program that finds all the Bab- 
bage primes that are less than 10,000. Use the Primes class from Problem 2.22. Your first 
five lines of output should look like this: 


0 41 is prime 
1 43 is prime 
2 47 is prime 
3 53 is prime 
4 61 is prime 


Two consecutive odd integers that are both prime are called twin primes. The twin primes 
conjecture is that there are infinitely many twin primes. Write a program that finds all the 
twin primes that are less than 1000. Use the Primes class from Problem 2.22. Your first five 
lines of output should look like this: 


3 5 
5 7 
11 13 
17 19 
29 31 


Test the conjecture that there is at least one prime between each pair of consecutive square 
numbers. (The square numbers are 1, 4, 9, 16, 25, . . .). Use the Primes class from Problem 
2.22. Your first five lines of output should look like this: 

1<2<4 

4<5<9 

9 < 11 < 16 

16 < 17 « 25 

25 < 29 < 36 


The Minimite friar Marin Mersenne (1588-1648) undertook in 1644 the study of numbers of 
the form n = 2? — 1, where p is a prime. He believed that most of these 7 are also primes, now 
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called Mersenne primes.Write a program that finds all the Mersenne primes for p < 30. Use 
the Primes class from Problem 2.22. Your first five lines of output should look like this: 


2 2A2-1 = 3 iS prime 

3 2A3-1 = 7 is prime 

5 2A5-1 = 31 is prime 

7 2A7-1 = 127 is prime 

11 2A11-1 = 2047 is not prime 


A number is said to be palindromic if it is invariant under reversion; that is, the number is the 
same if its digits are reversed. For example, 3456543 is palindromic. Write a program that 
checks each of the first 10,000 prime numbers and prints those that are palindromic. Use the 
Primes class from Problem 2.22. 


Answers to Review Questions 


An atray component can be any type: primitive, reference, or array. An array element is a component 
that is not itself an array type. So in a two-dimensional array al] [], the components of a[] are its row 
arrays, and the elements of a[][] are double variables. 


A multidimensional array is one that has more than one index. A Java array has only one index vari- 
able. However, since a components indexed by that variable can itself be an array (with an index), the 
original array appears to have more than one index. 


An ArrayIndexOutOfBoundsException object is an exception that gets thrown whenever a value 
less than 0 or greater than or equal to the array’s length is attempted to be used as an index on the 
array. This give the programmer some control over the consequences of such a run-time error. In lan- 
guages such as C++, such a run-time error normally causes the program to crash. 


An array index can have type byte, char, short, or int. 
The Array class cannot be instantiated because its constructor is declared private. 


The simplest way to print an array of objects is to pass it to the Arrays. toList() method which pro- 
duces a List object that can be printed directly with the System. out. printInQ method. 


The binary search will probably not work unless the sequence is sorted first. 


If the sequential search is applied to an element that occurs more than once in an array, it will return 
the index of the one that is closest to the beginning of the array. 


If the binary search is applied to an element that occurs more than once in an array, it could return the 
index of any one of them. It depends upon how close their indexes are to multiples of midpoints of 
subintervals. For example, if the binary search is applied in an array of 10,000 elements, searching for 
a value that is repeated at locations 0—99, the search would return the index 77 on the 7th iteration. 


Solutions to Problems 


public class TestFill { 
public static void main(String[] args) { 

Object[] a = new Object[4]; 
Arrays.fill(a, new DateQ)); 
ch02.ex02.DuplicatingArrays.print(a) ; 
Arrays.fill(a, 22); 
ch02.ex02.DuplicatingArrays.print(a) ; 
Arrays.fill(a, "Yo!"); 
ch02.ex02.DuplicatingArrays.print(a) ; 
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The sequential search runs in linear time, which means that the time is proportional to the number of 
elements. So an array with twice as many elements would take twice as long to process: 20 ms. 


The binary search runs in logarithmic time, so squaring the size of the problem should only double its 
running time. So an array with 10007 elements would take twice as long to process: 10 ms. 


The interpolation search runs in hyperlogarithmic time, so squaring the size of the problem should 
have no appreciable effect on its running time. So an array with 1,000,000 elements would also take 
about 2 ms to process. 


public class TestBinarySearch { 
private static final int SIZE = 10000; 
private static final int START = 0; 
private static final int RANGE = 10000; 
private static Random random = new Random(); 
private static int count = 0; 


public static void main(String[] args) { 
int[] a = new int[SIZE]; 
load(a, START, RANGE); 
Arrays.sort(a); 
search(a, random.nextInt(10000)) ; 
System.out.printInCcount + " iterations"); 


} 


public static void loadCint[] a, int start, int range) { 
for (int i = 0; i < a.length; i++) { 
afi] = start + random.nextInt(range); // random 5-digit numbers 
} 
} 


public static int searchCint[] a, int x) { 
int lo = 0; 
int hi = a. length; 
while Clo < hi) { 
++count; 
int i = Clo + hi)/2; 
if (afi] == x) { 
return i; 
} else if Cali] < x) { 
lo = i+1; 
3 else { 
hi = 1; 
} 
} 
return -1; 
} 
} 


boolean isSorted(int[] a) { 
if (a.length < 2) { 
return true; 
} 
for Cint i = 1; i < a.length; i++) { 
if cali] < af[i-1]) { 
return false; 
} 
} 


return true; 
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2.7 int minimumCint[] a) { 
int min = a[0]; 
for Cint i = 1; i < a.length; i++) { 
if cali] < min) { 
min = a[i]; 
F 
} 
return min; 


} 


2.8 int meanCint[] a) { 
double sum=0.0; 
for Cint i = 0; i < a.length; i++) { 
sum += ali]; 
return sum/a. length; 


} 


2.9 int[] withoutDuplicates(Cint[] a) { 
int n = a. length; 
if (n < 2) { 
return a; 
} 
for Cint i = 0; 1 < n-1; i++) f{ 
for Cint j = i+1; j <n; j++) f{ 
if (alj] == afi]) { 
--n: 
System.arraycopy(a, jt+tl, a, j, n-j); 


int[] aa = new int[n]; 
System.arraycopy(a, 0, aa, 0, n); 
return aa; 


} 


2.10 void reverse(int[] a) { 
int n = a. length; 
if (n < 2) { 
return; 
} 
for Cint i = 0; i < n/2; i++) { 
swap(a, 1, n-i-1); 
} 
} 
void swapCint[] a, int i, int j) { 
// swaps a[i] with a[j]: 
int ai = ali]; 
int aj = a[j]; 


ali] = aj; 
alj] = ai; 
} 
2.11 Object[] concatenate(Object[] a, Object[] b) { 


Object[] c = new Object[a. length+b. length] ; 
for Cint i = 0; i < a.length; i++) { 
cli] = ali]; 
} 
for Cint i = 0; 
c[ita. length] 
} 


i < b. length; i++) { 
= bli]; 
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return Cc; 


} 


void shuffle(Object[] a) f{ 
Random random = new Random(); 
int n = a. length; 
for Cint i = 0; i < n; i++) { 
ch02.prl10.TestReverse.swap(a,i,random.nextInt(a. length)) ; 
} 
} 


int[] tally(String s) { 

int[] frequency = new int[26]; 

for Cint i = 0; i < s.lengthQ); i++) { 
char ch = Character.toUpperCase(s.charAt(i)); 
if (Character.isLetter(ch)) { 

++frequency[Cint)ch - Cint)'A']; // count ch 
} 
return frequency; 


} 


double innerProduct(double[] x, double[] y) { 
double sum = 0.0; 
for Cint i = 0; i < x.length && i < y.length; i++) 
sum += x[i]*y[i]; 
return sum; 


} 


double[][] outerProduct(double[] x, double[] y) { 

double[][] z = new double[x. length] [y. length] ; 
for (double xi : x) { 

for (double yj : y) f{ 

zCil(j] = xi*yj; 

} 
} 
return Z; 


} 


double[][] product(double[][] x, double[][] y) { 
double[][] z = new double[x. length] [y[0]. length]; 
for Cint i = 0; i < x.length; i++) { 
for Cint j = 0; j < y[0].length; j++) { 
double sum = 0.0; 
for Cint k = 0; k < x[0].length; k++) f{ 
sum += x[i][k]*yLk][j]; 


} 
z[LiJ[j] = sum; 
} 
} 
return Z; 


} 


double[][] transpose(double[][] x) f{ 
double[][] y = new double[x[0]. length] [x. length]; 
for Cint i = 0; i < x[0].length; i++) { 
for Cint j = 0; j < x.length; j++) { 
yliJ(j] = xfjl]lid; 
} 
} 


return y; 
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2.18 int[][] pascalCint n) { 
int{J][] p = new int[n][n]; 
for Cint j = 0; j < n; j++) f{ 
pLjJ](0] = pljIlj] = 1; 
} 


for Cint i = 2; 1 <n; i++) { 


for Cint j 1; j < i; j++) { 
pLiJ([3] = pli-1](j-1] + pli-11[j]; 
} 
} 
return p; 
} 
2.19 public class TestSieve { 


private static final int SIZE=1000; 
private static boolean[] isPrime = new boolean[SIZE]; 


public static void main(String[] args) { 
initializeSieve(); 
printSieve(); 

} 


private static void initializeSieve() { 
for Cint i = 2; 1 < SIZE; i++) { 
isPrime[i] = true; 
} 
for Cint n = 2; 2*n < SIZE; n++) { 
if CisPrime[n]) { 
for (int m = n; m*n <SIZE; m++) { 
isPrime[m*n] = false; 
} 
} 
} 
} 


private static void printSieve() { 
int n=0; 
for Cint i = 0; i < SIZE; i++) { 
if CisPrime[i]) { 
System.out.printf("%5d%s", 71, ++n%16==0?"\n":""); 
} 
} 
System.out.printfC"%n%d primes less than %d%n", n, SIZE); 
} 
} 


2.20 public class TestSieve { 
private static final int SIZE=1000; 
private static Vector<Boolean> isPrime = new Vector<Boolean>(SIZE) ; 


public static void main(String[] args) { 
initializeSieve(); 
printSieve(); 

} 


private static void initializeSieve() { 
isPrime.add(false); // 0 is not prime 
isPrime.add(false); // 1 is not prime 
for Cint i = 2; 1 < SIZE; i++) { 
isPrime.add(true) ; 


i 
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for Cint n = 2; 2*n < SIZE; n++) { 
if CCisPrime.get(n))) f{ 
for Cint m =n; m*n < SIZE; m++) { 
isPrime.set(m*n, false); 
} 
} 
} 
} 


private static void printSieve() { 
int n=0; 
for Cint i = 0; i < SIZE; i++) { 
if CisPrime.get(i)) f{ 
System.out.printfC"%5d%s", 1, ++n%16==0?"\n":""); 
} 
} 
System.out.printfC"%n%d primes less than %d%n", n, SIZE); 
} 
} 


2.21 public class TestSieve { 
private static final int SIZE=1000; 
private static BitSet isPrime = new BitSet(SIZE) ; 


public static void main(String[] args) { 
initializeSieveQ); 
ch02.pr20.TestSieve.printSieve() ; 

} 


private static void initializeSieve() { 
for Cint i = 2; 1 < SIZE; i++) { 
isPrime.set(i); 
} 
for Cint n = 2; 2*n < SIZE; n++) { 
if CisPrime.get(n)) f{ 
for Cint m =n; m*n <SIZE; m++) { 
isPrime.clear(m*n) ; 
} 
} 
} 
} 


private static void printSieve() { 
int n=0; 
for Cint i = 0; i < SIZE; i++) { 
if CisPrime.get(i)) f{ 
System.out.printfC"%5d%s", 7, ++n%16==0?"\n":""); 
} 
} 
System.out.printfC"%n%d primes less than %d%n", n, SIZE); 
} 
} 


2.22 public class Primes { 
private static final int SIZE = 1000; 
private static int size = SIZE; 
private static BitSet sieve = new BitSet(size); 
private static int last = 1; 
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static { 

for Cint i=2; i<SIZE; i++) { 
sieve.set(i); 

} 

for Cint n=2; 2*n<SIZE; n++) { 
if (sieve.get(n)) { 

for Cint m=n; m*n<SIZE; m++) { 
sieve.clear(m*n); 


} 
} 
} 
} 
public static void setLast(Cint n) { 
last =n; 
} 
public static void setLastQ) { 
last = 1; 
} 
public static void setSizeCint n) { 
size =n; 
} 
public static void setSizeQ) { 
size = 1000; 
} 


public static boolean isPrimeCint n) { 
return sieve.get(n); 
} 
public static int nextQ) { 
while (++last<size) { 
if (sieve.get(last)) f{ 
return last; 
} 
} 
return -1; 
} 
public static void printPrimes() { 
int n=0; 
for Cint i=0; i<SIZE; i++) { 
if (sieve.get(i)) f{ 
System.out.print((n++%10==0?"\n":"\t")+7); 
} 
} 
System.out.printIn¢C"\n" + n + 
} 
} 


public static String factor(int n) { 
public static String factorCint n) { 
String primes=""; 
int p = next(Q); 
while (n > 1) { 
if (n%p==0) { 
primes += (primes. lengthQ)==0?"":"*") + p; 
n /= p; 
} 
else p = nextQ); 
if (p == -1) { 
primes += " OVERFLOW"; 
break; 


} 


primes less than " + SIZE); 
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} 
setLastQ); 
return primes; 


; 


public class TestGoldbach { 

public static void main(String[] args) { 
Primes.setSize(1000) ; 
System.out.printIn("4 = 242"); 
for Cint n = 6; n < 100; n += 2) { 


} 
+ 


} 


System.out.print(n); 
for Cint p = 3; p <= n/2; p += 2) { 
if (Primes.isPrime(p) && Primes.isPrime(n-p)) { 
System.out.printC" = "+p+"+"+(n-p)); 
} 
} 
System.out.printInQd; 


public class TestFermat { 
public static void main(String[] args) { 


: 


} 


public class TestBabbage { 

public static void main(String[] args) { 
Primes.setSize(1000) ; 
for Cint x = 0; x < 50; x++) f{ 


i; 


} 


P 


rimes.setSize(1000) ; 


for Cint p = 0; p < 5; p++) f{ 


} 


} 


int n = Cint)Math.pow(2,Math.pow(2,p)) + 1; 
if (Primes.isPrime(n)) { 
System.out.printIn("p = "+p+", n = 2A2Ap = "+n); 


} 


System.out.print(x); 
int n = x*x + x + 41; 
if (Primes.isPrime(n)) { 
System.out.printInC"\t"+n+" is prime"); 
3} else { 
System.out.printInQ; 
} 


public class TestTwinPrimes { 

public static void main(String[] args) { 
Primes.setSize(1000) ; 

int n = Primes.nextQ); 

while (n < 0.9*N) { 


} 


i 


} 


if (Primes.isPrime(n+2)) { 
System.out.printIn(n + "\t" + (n+2)); 

} 

n 


= primes.nextQ); 
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public class TestSquares { 
public static void main(String[] args) { 
Primes.setSize(1000) ; 
for Cint n = 1; n < 100; n++) f{ 
for Cint i = n*n+1; i < (n+1)*(n+1); i++) { 
if (Primes.isPrime(i)) { 

System.out.printf("%d < %d < %d%n", n*n, i, Cn+1)*(n+1)); 
break; 


public class TestMersenne { 
public static void main(String[] args) { 

Primes.setSize(1000) ; 

for Cint p = Primes.next(); p < 30; p = Primes.nextQ)) { 
int n = Cint)Math.round(Math.pow(2,p)) - 1; 
System.out.printfC"%d\t2A%d-1%d", p, p, n); 
if (Primes.isPrime(n)) { 

System.out.printInC" is prime "); 


3 else { 
System.out.printIn(" is not prime "); 
} 
} 
} 
} 
boolean isPalindromicCint n) { 
if (n < 0) f{ 
return false; 
} 
int p1l0=1; 
// make p10 is the greatest power of 10 that is <n 
while (p10 <n) { 
plo *= 10; 
} 
p10 /= 10; 
while (n > 9) { 
if (n/p10 != n%10) { 
return false; 
} 
n /= 10; // remove rightmost digit from n 
p10 /= 10; 
n %= p10; // remove leftmost digit from n 
} 
return true; // single digit integers are palindromic 
} 
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Arrays work well for unordered sequences, and even for ordered sequences if they don’t 
change much. But if you want to maintain an ordered list that allows quick insertions and 
deletions, you should use a linked data structure. This chapter shows how to do that. 


MAINTAINING AN ORDERED ARRAY 


Chapter 2 outlines how the binary search can find elements very quickly in an array that is 
sorted. This suggests that we should keep our arrays in sorted order. But inserting new elements 
in an ordered array is difficult. The main problem is that we have to shift all the larger elements 
forward to make room for the new element to be placed in its correct ordered position. This can 
be done by the insert() method shown in Example 3.1. 


EXAMPLE 3.1 Inserting into an Ordered Array 


void insert(int[] a, int n, int x) { 


1 

2 // preconditions: a[0O] <= ... <= a[n-1], and n < a.length; 

3 // postconditions: a[0] <= ... <= a[n], and x is among them; 

4 int i = 0; 

5 while (i < n && a[i] <= x) { 

6 +475 

7 } 

8 System.arraycopy(a, i, a, i+1, n-i); // copies a[i..n) into a[i+1..n+1) 
9 im |] = X; 


10 } 

The insert() method takes three arguments: the array a[], the number n of elements that are already 
sorted in the array, and the new element x to be inserted among them. The preconditions at line 2 specify 
that the first n elements of the array are in ascending order and that the array has room for at least one 
more element. The postconditions at line 3 specify that the array is still in ascending order and that x has 
been successfully inserted among them. 

The code at lines 4—7 searches the array for the correct position for x to be inserted. It should be the 
smallest index i for which a[i] > x. For example, if x = 50 for the array shown in Figure 3.1, then the 
correct position for x is at index i = 1, because a[0] <= x < a[1]. 

After the correct position i has been located for x, the insert() method shifts the elements that are 
greater than x one position to the right. This is accomplished by the call 
System.arraycopy(a, i, a, i+1, n-i); 
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Figure 3.1 Making room for the new element 


at line 8. The arraycopy() method is a static method in the System class. It is usually the most 
efficient way to copy elements between arrays or within a single array. Its five arguments are: the source 
array, the index of the first element to be copied from the source array, the destination array, the index in 
the destination array where the first element is to be copied, and the number of elements to be copied. If n 
= 4 and i = 1, as shown in Figure 3.1, then the call is 
System.arraycopy(a, 1, a, 2, 3); 
This shifts elements {a[1], a[2], a[3]} = {66, 88, 99} into elements {a[2], a[3], a[4]}. 
Finally, x is inserted into a[i] at line 9, as shown in Figure 3.2. 


* | 50 
a[1] = 50 va 
0 1 2 3 4 
2/33 | 50 66 88 99 ? ? ? 


Figure 3.2 Copying x into its correct position 


The insert() method may have to move a lot of data. For example, if n = 1000 and x is less 
than all of those elements, then the method will move all 1000 elements. On average, inserting 
into a sorted array of elements will move 7/2 elements. So this is a O(7) operation. 

Deleting an element is simply the reverse of the insertion process. It too will have to move n/2 
elements, on average. So deletion is also a @(n) operation. 


INDIRECT REFERENCE 


One solution to the data movement problem that is intrinsic to dynamic ordered arrays is to 
use an auxiliary index array to keep track of where the elements actually are. This solution 
requires more space (a second array) and makes the code a bit more complicated. But it elimi- 
nates the need to move the elements. It allows the elements to be stored at an arbitrary position in 
the array, using the auxiliary index array to locate them for ordered access. 

The main idea is shown in Figure 3.3. The elements {22, 33, 44, 55, 66} are kept in arbitrary 
positions in the array a[], and their order is determined by some auxiliary mechanism. 
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Figure 3.3 Referring to the order of the array elements 


Each element is kept in a numbered component: 22 is in component 3, 33 is in component 5, 
44 is in component 1, and so on. So if we save the order of the index numbers (3, 5, 1, 4, 6), then 
we can access the elements in order: a[3] followed by a[5] followed by a[1], and so forth. 

An index array is an array whose elements are index values for another array. By storing the 
index numbers 3, 5, 1, 4, 6 in an index array k[] (shown in Figure 3.4), we can use them to 
access the data elements 22, 33, 44, 55, 66 in order. 


Figure 3.4 Using an index array 


Now this may be an improvement, but it is not optimal. The reason we wanted to allow the 
element to be stored in arbitrary positions in the first place was to simplify the insertion and 
deletion operations. We wanted to avoid having to shift segments of a[] back and forth. But the 
solution shown in Figure 3.4 merely transfers that obligation from a[] to k[]. If we had to insert 
the element 50, we could put it at position a[0] or a[2] or any place after a[6], but we would 
then have to insert its index into the index array k[] between k[2] and k[3] to keep track of the 
order of the elements. 

A better solution is to use the same array positions in the index array k[] as we are using in 
the data array a[]. Since the index array is keeping track of the correct order of the index 
numbers of the data elements, it can be used to do the same for the index numbers themselves. 


am Le; 44 mee Se eae: ? 
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‘ Ee > Ee 2 | 2 [2 free| 7 


Figure 3.5 Using an index array 
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In Figure 3.5, the index array k[] keeps track of the order of the elements in a[]. The starting 
index 3 is stored in k[0]. That begins the chain of indexes: k[0] = 3, k[3] = 5, k[5] = 1, 
k[1] =4, k[4] = 6, k[6] = 0. The index 0 signals the end of the ordered sequence. The index 
sequence 0, 3, 5, 1, 4, 6 gives us the data elements in order: a[3] = 22, a[5] = 33, a[1] = 44, 
a[4] =55, a[6] = 66. 

The extra variable free, shown Figure 3.5, saves the index of a free location in both the index 
array k[] and the data array a[]. The value 7 means that k[7] and a[7] should be used next. 

The implementation of an index array solves the problem of having to shift segments of array 
elements back and forth during deletions and insertions. For example, to insert x = 50 in Figure 
3.5, we first traverse the sequence to find the index i of the largest element that is less than x: 
i = 1. Then just follow these three steps: 


a[free] = x; // put x into the next free position 
k[free] = k[i]; // store the next index in that position in k[] 
k{Ci] = free++4; // store the index of x in k[] and increment free 


The results are shown in Figure 3.6. 


x 


(2h 


0 1 2 3 4 5 6 7 8 9 
| BE | |: | IE free| 8 


Figure 3.6 Inserting an element 


The Java code for this algorithm is shown in Example 3.2. This improves the insertQ 
method shown in Example 3.2, because its only data movement is the actual insertion of x into 
the array a[] at line 5. 


EXAMPLE 3.2 Inserting into an Array that Is Indirectly Ordered 


void insertCint x) { 

int i=0; 

while Ck[i] != 0 && a[k[i]] < x) f{ 
i= k{il; 


1 
2 
3 
4 
5 } 
6 a[free] = x; 
7 k{free] = k[i]; 
8 k{i] = free++; 

9 t 

The while loop at lines 3—5 is similar to the while loop at lines 5—7 in Example 3.1 on page 46: it 
finds the first index i for which a[k[i]] > x. At line 6, x is inserted in the next free location in the array 
a[]. At line 7, the index of the next location after x is stored in k[free]. At line 8, the index of x is 
copied into k[i], and then free is incremented to the index of the next free location. 

Note that this code assumes that the array is large enough to accommodate all elements that might be 


inserted. In practice, we would probably include a resize() method. 
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LINKED NODES 


The values of the index array k[] in Figure 3.6 are used as locators, addressing the actual data 
array a[]. We don’t really need a separate array for them. Their relative positions in the index 
array match the positions of the corresponding data elements. So we can combine them into a 
single array of data-address pairs, as shown in Figure 3.7: 


4 5 6 7 
a elastase Teles ti estoy? 


Figure 3.7 Storing the indexes with their elements in the same array 


In this version, the array a[] would be defined as shown in Example 3.7. 
Node[] a = new Node[size]; 
where Node would now be a separate class, defined like this: 
class Node { 
int data; 
int next; 
} 
This makes the array a[] a little more complex, but it eliminates the need for an auxiliary array 
altogether. 

Fortunately, Java allows an even better solution, one that allows us to eliminate both arrays! 
Taking an object-oriented point of view, we see in Figure 3.8 a sequence of Node objects. Each 
object contains a data element and the address of the next object in the sequence. In Java, objects 
are directly accessed by their addresses. That’s what an object reference is: the address of where 
the object is stored in memory. So by reinterpreting the meaning of “address,” as a memory 
address (1.e., object reference) instead of an array index, we can simplify the structure to the one 
shown in Figure 3.8. Here, the arrows represent object references (i.e., memory addresses). 


start | 


44 22 | 55 |@| | 33 66 |e 


Figure 3.8 Using objects for the elements and their references 


Now, instead of an array aL], we need only keep track of the single start reference. The Java 
runtime system does all the rest of the bookkeeping. The code is given in Example 3.3. 


EXAMPLE 3.3 A Node Class 


data [22] next [e—+—> 


1 class Node { Node 
int data; 


Node next; Figure 3.9 A Node object 
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Node(int data) { 
this.data = data; 


} 


N On 


8 } 
Notice that the Node class is now self-referential: Its next field is declared to have type Node. 
Each Node object contains a field that is a reference to a Node object. 

The other field in the Node class is its data field, declared at line 2 here to be an int. Of 
course in general this field could be any type we want—whatever type values we have to store in 
the list. 

The Node class in Example 3.3 also includes a one-argument constructor, at line 5. Note that, 
since we have explicitly defined a constructor that takes at least one argument, the compiler will 
not implicitly define a no-argument constructor. Therefore, since we have vot explicitly defined a 
no-argument constructor, none will exist. That means that the only way a Node object can be 
created is with the one-argument constructor (at line 5); that is, we must provide a data value for 
each new Node object that we create. 

Figure 3.9 shows a typical Node object. Its data field contains the integer 22, and its next 
field contains a reference to another Node object (not shown). Although it is common to use an 
arrow like this to represent an object reference, it is good to keep in mind that the actual value of 
the reference is the memory address of the object to which it refers. In other programming 
languages, such variables are called pointers; hence their common depiction as arrows. 

Recall that in Java each reference variable either locates an 
object or is null. The value nul] means that the variable does Wana ce 
not refer to any object. The memory address that is stored in a Node 
null reference variable is 0x0 (the hexadecimal value 0); no 
object is ever stored at that address. Figure 3.10 shows a Node 
object whose next field is nul]. 

Example 3.4 shows how the five-element list could be built. start a 


next |e 


Figure 3.10 Another Node object 


EXAMPLE 3.4 Constructing a Linked List 37 Te 


Node start = new Node(22); 
start.next = new Node(33); Figure 3.11 Initializing start 
start.next.next = new Node(44); 
start.next.next.next = new Node(55); 
start.next.next.next.next = new Node(66); 
At line 1, we create a node containing the data value 22 and initialize our start variable to it. The 
result is shown in Figure 3.11. Note that the start variable is merely a reference to the Node object. Also 
note that the next reference in the Node object is nu11, indicated by the black dot with no arrow emanat- 
ing from it. The node’s next field is nul1 because the constructor (defined at line 5 in Example 3.3 on 
page 50) does not initialize it. In Java, every class field that is an object reference (i.e., its type is either a 
class or an interface) is automatically initialized to nu11, unless it is initialized by its constructor to some 
existing object. 
In the figures that follow, each Node object is shown as a box with start 
two parts: the left side contains the integer data, and the right side 
contains the next reference. This simply abbreviates the versions WD 
shown in Figure 3.9. 
Continuing the code in Example 3.4, at line 2, the start node’s 33 
next field is assigned to a new Node object containing the data 33. 
Now the list has two nodes, as shown in Figure 3.12. Figure 3.12 Adding a node 


a fF wowN = 
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The next node is added to the end of the list at line 3. To do that, we have to assign it to the next field 
of the node that contains 33. But the only node to which we have external access (i.e., the only node that 
has a variable name) is the first node. Its name is start. So we have to use the expression 
start.next.next to refer to the next field of the node that contains 33. 

Similarly, the fourth node is added at line 4 using start 
the expression start.next.next.next, and the 
fifth node is added at line 5 using the expression 0D \ 


start.next.next.next.next. That finally gives 
us the five-node list shown in Figure 3.13. a 


The code in Example 3.4 is clumsy and a4 
unsuited for generalization. Obviously, if we \ 
wanted to build a linked list of 50 nodes, this BE { 


approach would be unworkable. The solution is 
to use a local reference variable that can “walk 66 le 
through” the list, locating one node after the 
other and thereby giving local access to the 
nodes. Traditionally, the variable p (for “pointer’’) is used for this purpose. Since it will refer to 
individual nodes, it should be declared to be a Node reference, like this: 
Node p; 
And since our only access to the nodes is from the start node, we should initialize p like this: 
Node p=start; 
This is shown in Figure 3.14. Then the assignment 
p = p.next; 
will advance the locator variable p to the next node, as shown in Figure 3.15. This same assign- 
ment can thus be executed as many times as is needed to advance through the linked list. 


can ce 
p —< 


an ae 


Figure 3.13 The five-node list 
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Figure 3.14 Initializing p at the start node Figure 3.15 Advancing p to the second node 


Example 3.5 shows how we could have used this technique to build the linked list in the first 
place. 


EXAMPLE 3.5 Constructing a Linked List 


1 start = new Node(22); 
2 Node p=start; 

3 p.next = new Node(33); 
4 p = p.next; 
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.next = new Node(44); 
= p.next; 
-next = new Node(55); 
= p.next; 
p.next = new Node(66) ; 
This code may not seem much better than the other version in Example 3.4. But one big advange is that 
it is easily managed within a loop. For example, the same list can be built with the three lines of code in 
Example 3.6. 


oO oN DH 


EXAMPLE 3.6 Using a for Loop 


1 Node start = new Node(22), p = start; 
2 for Cint i=0; 1<4; i++) { 

3 p = p.next = new Node(33+11*71); 

4 


} 


Obviously, this form could just as easily build a linked list of 50 nodes. Each step in the execution of 
this code is shown in Figure 3.16.The reference variable p is analogous to an array index i: It 
advances through the nodes of a linked list just as i advances through the elements of an array. 
Consequently, it is natural to use p in a for loop, just as we would use the array index i. For 
example, compare Example 3.7 with Example 3.8. 


start ‘i 


p = start = new Node(22) 22 |e 
start p 

p = p.next = new Node(33) 22 |e-—>|_ 33 |e 
— p 

p = p.next = new Node(44) 22 [o+-——| 33 [o+—-| 44 [eo 
_ p| 

p = p.next = new Node(55) 22 [e-—e|[ 33 [o+—e-| 44 [o> | 55 [eo 

\ 33 * p| 
p = p.next = new Node(66) 22 / 44 [o+——e| 55 [eo >| 66 [eo 


Figure 3.16 Trace of Example 3.6 
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EXAMPLE 3.7 Using a for Loop to Print a Linked List 


1 for (Node p = start; p != null; p = p.next) { 
2 System.out.printIn(p.data) ; 
3 } 
EXAMPLE 3.8 Using a for Loop to Print an Array 
1 for Cint i=0; i < n; i++) { 
2 System.out.printIn(a[i]); 


3 
In both listings, the for loop prints one element on each iteration. The for statement has a three-part 
control mechanism. The first part declares the control variable (p for the list, i for the array) and initializes 
it to the first element: 
Node p=start 
int i=0 
The second part gives the continuation condition, asserting that there are more elements: 
p != null 
i<n 
The third part gives the update expression, advancing the control variable to the next element: 
p = p.next 
i++ 
In each of these parts, the two versions are analogous. 


Example 3.9 shows a test driver for a simple external Node class. 


EXAMPLE 3.9 Testing the Node Class 


1 public class TestNode { 
2 public static void main(String[] args) { 
3 Node start = new Node(22); 
4 Node p = start; 
5 for Cint i = 1; i < 5; i++) { 
6 p = p.next = new Node(22 + 11*1); 
7 } 
8 for (p = start; p != null; p = p.next) { 
9 System.out.printIn(p.data) ; 
10 } 
1 for (p = start; p != null; p = p.next) { 
12 System.out.printIn(p) ; 
13 } 
14 } 
15 } 
16 
17 class Node { 
18 int data; 
19 Node next; 
20 Node(int data) { 
21 this.data = data; 
22 } 
23 } 
The output is: 
22 
33 
44 
55 
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Node@7182c1 
Node@3f5d07 
Node@f4a24a 
Node@cac268 
Node@a16869 
The first node is constructed at line 3. Then the for loop at lines 5—7 constructs the other four nodes. 
The second for loop at lines 8—10 prints the node data in the first five lines of output. The third for loop 
at lines 11-13 gives the actual memory addresses of the five Node objects. 
When you use an object reference like p in a string expression such as 
System.out.printIn(p); 
the system automatically invokes that object’s toString() method. Unless it has been overridden, the 
version of the toString() method that is defined in the Object class will execute, as it did in the 
program in Example 3.9. The string returned by that version merely contains the object’s type (Node) 
followed by the @ sign and the memory address of the object (7182c1). So the last five lines of output 
report that the five Node objects are stored at the (hexadecimal) memory addresses 0x7182c1, 0x3f5d07, 
Oxf4a24a, Oxcac268, and 0xal6869. These then are the actual values stored in the reference variables 
start, start.next, start.next.next, start.next.next.next, and start.next.next.next.next. 
You can see from Figure 3.17 why we usually draw linked 


lists using arrows to represent the Node references. Showing start | 0x7182c1 
the actual memory address values instead requires more effort 
to see which node references which. Moreover, those memory 0x7182¢1 
address values are runtime dependent: They will be different data [22] next! Ox3t5d07 
on different computers, and maybe even on the same computer Node 
at different times. 

One final note: At line 6 we use the chained assignment Ox3F5d07 

p = p.next = new Node(22+11*i); data [33] next | Oxf4a24a | 

It is important to remember the order of operations in such a Node 
statement. Here, the first thing that happens is the evaluation 
of the expression 22 + 11*i. When 7 is 1, that evaluates to 33; Oxf4a24a 
when 7 is 4, it evaluates to 66. After the value is obtained, it is Babe 44 [emexe] Oxcac2zo5 
passed to the Node class constructor at line 5 of Example 3.3 oe 
on page 50. That constructs a node with that value in its data Gecacabe 


field and null in its next field. The constructor returns a Gata 55 | next Oxalesed 
reference to the Node object. It is that reference that is 


assigned first to p.next, and then to p. The key is that the or 

assignments are made from right to left. So we know that p is 0xa16869 

not updated until after its next field is. So, first the next field data [66] next 0x0 
is set to point to the new node, and then the loop control Node 


variable p is advanced to that next node. . : 
Figure 3.17 The five Node objects 


INSERTING AN ELEMENT INTO A LINKED LIST 


Recall how new elements were inserted into the linked list that was built in Figure 3.16 on 
page 53. To simplify the process, we add a two-argument constructor to our Node class, as shown 
in Example 3.10 on page 56. This allows us to create the node and insert it all at once. 

Figure 3.18 illustrates the invocation of the two-argument Node constructor. It shows next as 
a reference to a Node object and x as an int with value 50. Passing these two arguments to the 
constructor creates a new Node object that contains 50 and whose next field points to the same 
object that the given next pointer points to. The constructor then returns a reference to the new 
Node object, which is assigned to q. 
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The code for inserting an element into a nonempty linked list is given in Example 3.11. To 
appreciate its simplicity, compare it with the equivalent method in Example 3.2 on page 49. 


EXAMPLE 3.10 A Node Class with Two Constructors 


1 class Node { 
2 int data; 
3 Node next; 
4 
5 Node(int data) { 
6 this.data = data; 
7 } 
8 
9 NodeC(int data, Node next) { 
10 this.data = data; 
"1 this.next = next; 
12 } 
13 } 
nextle+-—»| 7? |e nextle+-—e| ? |e 
q = new Nodle(x.next) > Fé 
x | 50 q[e++—e| 50 


Figure 3.18 Invoking the two-argument Node constructor 


The insertion has two steps: (1) find the list node p that should precede the new node; (2) create and 
attach the new node. 


EXAMPLE 3.11 Inserting into a Nonempty Sorted Linked List of Integers 


1 void insert(Node start, int x) f{ 

2 // PRECONDITIONS: the list is in ascending order, and x > start.data; 
3 // POSTCONDITIONS: the list is in ascending order, and it contains x; 
4 Node p = start; 

5 while (p.next != null) { 

6 if (p.next.data > x) break; 

7 p = p.next; 

8 

9 p.next = new Node(x,p.next); 


10 } 

The first step is done by the loop at lines 5—8. The variable p is declared at line 4 to be a reference to 
Node objects. It is initialized to point to the start node, which contains 22 in Figure 3.19. The loop control 
condition (p.next != nu11) at line 5 will allow the loop to iterate until p points to the last element in the 
list. At that point, p.next will be nu11, stopping the loop. But inside the loop, at line 6, the condition 
(p.next.data > x) will stop the loop prematurely, before p reaches any nodes that should come after the 
new node. This is how the list remains in ascending order: New elements are always inserted between the 
elements that are less than it and those that are greater than it. 

The assignment p = p.next at line 7 is the standard mechanism for traversing a linked list. On each 
iteration of the while loop, this assignment moves p to point to the next node in the list. 

The actual insertion is done by the statement at line 9. The expression new Node(x,p.next) creates the 
new node and initializes its two fields, as we saw previously in Figure 3.18. In that version, it assigned the 
new node’s reference to q. The statement at line 7 assigns it to p.next instead. This changes the next 
pointer of the p node (the node containing 44): it was pointing to the node containing 55; now it points to 
the new node that contains 50. 
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start jo—e| 22 |e-—e| 33 |e} —|_ 44 [0 — | 55 [0+ — | 66 [e 


p.next = new Node(50,p.next) 


start |e——e| 22 |e+-—e-| 33 oa 55 |je-——e| 66 |e 
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Figure 3.19 Inserting into a nonempty sorted linked list 
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Figure 3.20 Inserting the new node in three steps 


This second stage of the insertion could be done by start [e+—e| 22 je-——>| 33 |e} 
several separate statements, like this: 
Node q = new Node(x); 


q.next = p.next; insert(start, 20) 

p.next = q; 
These separate steps are illustrated Figure 3.20. Once 
we understand this process, we might as well use the start [e}+—e[ 22 33 |eo}—> 
power of Java and write it in the single statement \ if 

p.next = new Node(x, p.next); 0 


without the clutter of the extra variable q. Rise 9-91 adsertind 20ineortecdly 
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INSERTING AT THE FRONT OF THE LIST 


The insert() method in Example 3.11 on page start [e}+—e[ 22 [e+—[ 33 [e+-—> 
56 includes the extra precondition that x be greater 
than the first element in the list (start.data). To 
see why that precondition is needed, look at what 
the method would do if x were 20 instead of 50. In 


that case, the break condition at line 6 would be start 22 [e+—»| 33 [e-—_> 
20 |é 


insert(start, 20) 


true on the first iteration of the whi 1e loop, leaving 
p pointing at the start node when the new node 
gets inserted at line 9. The result, as shown in 
Figure 3.21 on page 57, is that 20 gets inserted Figure 3.22 Inserting 20 correctly 
between 22 and 33, instead of where it belongs at 

the front of the list. The problem is that we lack a node to precede the new one. 

One way to solve this problem is to restructure the linked list itself so that it maintains a 
“dummy” head node that precedes the first real data node. This uses a little extra space, but it 
allows the insert() method in Example 3.11 to work for all cases. 

The other solution is to modify the insert() method in Example 3.11 so that it handles this 
special case separately. This is done in Example 3.12 and illustrated in Figure 3.22. There are 
two situations in which the insert should be done at the front of the list: if the list is empty or if 
the new element is less than the first element of the list. Both conditions are handled at line 4. In 
the first case, we could simply reset start to a new node containing x, like this: 

start = new Node(x); 
using the one-arg constructor. In the second case, we also have to assign the new node to start, 
but we also have to connect it to the rest of the list. But the only reference we have to the begin- 
ning of the list is start itself, so we would have to hold that reference in a temporary variable 
before reassigning start to the new node. 


EXAMPLE 3.12 Linked List Insertion 


Node insert(Node start, int x) { 


1 

2 // precondition: the list is in ascending order; 
3 // postconditions: the list is in ascending order, and it contains x; 
4 if (start == null || start.data > x) { 

5 start = new Node(x,start); 

6 return start; 

7 } 

8 Node p=start; 

9 while (p.next != null) { 

10 if (p.next.data > x) break; 

TT p = p.next; 

12 } 

13 p.next = new Node(x,p.next); 

14 return start; 

15 } 


Using the two-argument constructor obviates the need for that extra temporary assignment: 
start = new Node(x,start); 
Moreover, it also handles the first case, where the list was empty, because in that case, start is 
null, and passing nu11 to the second parameter is equivalent to using the one-arg constructor: 
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start = new Node(x, null); // equivalent 
start = new Node(x); // equivalent 
So once again, the two-argument constructor provides the best solution. 
Note that unlike the simpler version in Example 3.11, the complete insert() method in 
Example 3.12 has to return the start node reference, because that reference may be changed at 
line 5. 


DELETING FROM A SORTED LINKED LIST 


Implementing an ordered list with a linked structure makes insertion far more efficient 
because it eliminates the need to shift elements. The same is true for deletion. 

Like the insert() method, the delete() method has two main parts: (1) find the element; 
(2) delete it. It also handles the special case at the front of the list separately. Example 3.13 shows 
the delete() method. 


EXAMPLE 3.13 Linked List Deletion 


1 Node delete(Node start, int x) { 

2 // precondition: the list is in ascending order; 

3 // postconditions: the list is in ascending order, and if it did 
4 // contains x, then the first occurrence of x has been deleted; 
5 if (start == null || start.data > x) { // x is not in the list 
6 return start; 

7 } else if (start.data == x) { // X is the first element in the list 
8 return start.next; 

9 } 

10 for (Node p = start; p.next != null; p = p.next) f{ 

1 if (p.next.data > x) f{ 

12 break; // X iS not in the list 

13 } else if (p.next.data == x) { // X is in the p.next node 

14 p.next = p.next.next; // delete it 

15 break; 

16 } 

17 } 

18 return start; 


19 } 

If the list is empty, then start == nul11 and nothing has to be done. Also, if the first element is greater 
than x, then since the list is sorted, all the elements must be greater than x, so x is not in the list. Both of 
these cases are handled first at line 5. 

If the first element in the list equals x, then it is deleted at line 8. This is done by returning start.next 
to start, as shown in Figure 3.23. If no other reference is pointing to the original start node, then it will 
be deleted by the Java “garbage collector.” 

If the first element of the list is less than x, then the for loop at line 10 searches for the first element 
that is greater than or equal to x. If it finds one greater, then the method breaks at line 12 and returns 
without changing the list. If it finds an element equal to x, then it deletes it at line 14. This is illustrated in 
Figure 3.24. 


NESTED CLASSES 


In Java, a class member may be a field, a constructor, a method, an interface, or another class. 
A class that is a member of another class is called a nested class. 
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start [e-—e|_22 |e-—>| 33 |e-— 


delete(start, 22) 


start 22 |e—e| 33 [eo — 


garbage collected 


start [e-—————| 33 [9 — 


Figure 3.23 Deleting the first element from a sorted linked list 


start [e+-—| 22 |e+—e| 33 [e+ 44 [e+ | 55 (0+ | 66 [o 


delete(start, 55) 


start [e+—e| 22 [e+—»| 33 [e+ 44 55 |e—-| 66 |e 


garbage collected 


start [e}+—e| 22 [e+—e| 33 [e+e | 44 [o> 66 [eo 


Figure 3.24 Deleting any other element from a sorted linked list 


If the only place where a class Y will be used is within another class X, then class Y should be 
nested within class x. This is an important example of the information hiding principle that we 
have applied in other contexts. 


If X is any type (class or interface) and Y is any other type nested within X, then every member 
of X is accessible from Y and every member of Y is accessible from X. This is illustrated in 
Example 3.14. 

The Main class in Example 3.14 has a private nested class named Nested. Both classes have 
a private int field. Main declares and initializes m at line 2; Nested declares and initializes n at 
line 15. The Nested class also defines a private method f() at line 17. 
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EXAMPLE 3.14 Accessibility from Nested Classes 


1 public class Main { 
2 private int m = 22; 
3 
4 public Main() { 
5 Nested nested = new Nested(); 
6 System.out.printInC"Outside of Nested; nested.n = " + nested.n); 
7 nested. fQ; 
8 } 
9 
10 public static void main(String[] args) { 
1 new MainQ(); 
12 } 
13 
14 private class Nested { 
15 private int n = 44; 
16 
17 private void fO { 
18 System.out.printInC"Inside of Nested; m= " +m); 
19 } 
20 } 
21 } 
The output is: 


Outside of Nested; nested.n = 44 
Inside of Nested; m = 22 


The main() method invokes the Main() constructor at line 11. That instantiates the Nested class at 
line 5. The private field n of the Nested class is accessed at line 6, and the private method FQ of the 
Nested class is accessed at line 7. This shows that private members of a nested class are accessible from 
its enclosing class. Symmetrically, the private members of the enclosing class are accessible from 
within its nested class, as demonstrated by line 18. 


The UML symbol for the 
nesting of one class inside Main ~e——,__ Nested 
another uses a circle with a TS (SSE 
plus sign inside in place of 
the arrowhead, as shown in 
Figure 3.25. 

Since all members of a 
private nested class are still Figure 3.25 UML diagram for a nested class 
accessible from anywhere 
else in the enclosing class, 
those members are usually declared without any access modifier (private, protected, or 
public), for simplicity. 

Normally, a nested class should be declared static unless its instances need to access 
nonstatic members of its enclosing class. (A nested class that is nonstatic is called an inner 
class.) 


+ Main() - fO 
+ main(String[]) 


The Node class defined in Example 3.10 on page 56 is used only within the context of the 
linked lists that are being implemented. So it should be nested inside its List class. Moreover, 
since nodes have no need to access List methods or fields, the Node class should be declared as 
a static nested class. This is done at line 12 in Example 3.15 and is illustrated in Figure 3.26. 
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EXAMPLE 3.15 Nesting the Node Class within a LinkedList Class 


1 
2 
3 
4 
5 
6 
if 
8 
9 


public class LinkedList { 
private Node start; 


public void insertCint x) { 
// Insert lines 2-14 of Example 3.12 on page 58 
} 


public void deleteCint x) { 
// Insert lines 2-18 of Example 3.13 on page 59 
} 


private static class Node { 
// Insert lines 2-12 of Example 3.10 on page 56 
} 
} 


Hiding the Node class within the LinkedList class encapsulates the LinkedList class, making it self- 
contained and concealing its implementation details. A developer could change the implementation 
without having to modify any code outside of that class. 


3.1 
3.2 
3.3 
3.4 


3.5 


3.1 


LinkedList -)}—_————————_ Node 
- start: Node data: int 
next: Node 


+ insert (Node, int) 
+ insert(Node, int) Node (int) 


Node(Cint, Node) 


Figure 3.26 A Node class nested within a LinkedList class 


Review Questions 


Why is an array such an inefficient data structure for a dynamic sorted list? 
What is an index array? 
If linked lists are so much better than arrays, why are arrays used at all? 


Why does insertion at the front of a linked list have to be done differently from insertion else- 
where? 


Why are the lists backwards in the BigInt class? 


Problems 


Write and test this method, similar to the insert() method in Example 3.1 on page 46: 
void delete(int[] a, int n, int x) 
// precondition: 0 <= n < a.length; 
// postconditions: the first occurrence of x among 
{a[0O], ..., a[n-1]} has been deleted; 
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3.2 


3.3 


3.4 


3.5 


3.6 


3.7 


3.8 


3.9 


For example, if a[] is the array {33, 55, 77, 99, 77, 55, 33, 0}, then delete(a, 6, 55) will 
change a[] to {33, 77, 99, 77, 55, 33, 0, 0}. 


Write and test this method: 

int size(Node list) 

// returns: the number of nodes in the specified list; 
For example, if list is {33, 55, 77, 99}, then size(list) will return 4. 


Write and test this method: 

int sum(Node list) 

// returns: the sum of the integers in the specified list; 
For example, if list is {25, 45, 65, 85}, then sum(list) will return 220. 


Write and test this method: 

void removeLast(Node list) 

// precondition: the specified list has at least two nodes; 

// postcondition: the last node in the list has been deleted; 
For example, if list is {22, 44, 66, 88}, then removeLast(list) will change it to {22, 44, 
66}. 


Write and test this method: 

Node copy(Node list) 

// returns: a new list that is a duplicate of the specified list; 
Note that the new list must be completely independent of the specified list. Changing one list 
should have no effect upon the other. 


Write and test this method: 

Node sublist(Node list, int p, int q) 

// returns: a new list that contains copies of the q-p nodes of the 

// specified list, starting with node number p (starting with 0); 
For example, if list is {22, 33, 44, 55, 66, 77, 88, 99}, then sublist(list, 2, 7) will 
return the new list {44, 55, 66, 77, 88}. Note that the two lists must be completely indepen- 
dent of each other. Changing one list should have no effect upon the other. 


Write and test this method: 

void append(Node list1, Node list2) 

// precondition: listl has at least one node; 

// postcondition: listl has list2 appended to it; 
For example, if listl is {22, 33, 44, 55} and list2 is {66, 77, 88, 99}, then 
append(list1, list2) will change list1 to {22, 33, 44, 55, 44, 55, 66, 77, 88}. Note that 
no new nodes are created by this method. 


Write and test this method: 
Node concat(Node list1, Node list2) 
// returns: a new list that contains a copy of list1l, followed by 
// a copy of list2; 
For example, if listl is {22, 33, 44, 55} and list2 is {66, 77, 88, 99}, then 
concat(list1, list2) will return the new list {22, 33, 44, 55, 44, 55, 66, 77, 88}. Note 
that the three lists should be completely independent of each other. Changing one list should 
have no effect upon the others. 


Write and test this method: 

void set(Node list, int i, int x) 

// replaces the value of element number i with x; 
For example, if list is {22, 33, 44, 55, 66, 77, 88, 99}, then set(list, 2, 50) will change 
list to {22, 33, 50, 55, 66, 44, 88, 99}. 
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Write and test this method: 
int get(Node list, int 7) 
// returns the value of element number i; 
For example, if list is {22, 33, 44, 55, 66, 77, 88, 99}, then get(list, 2) will return 44. 


Write and test this method: 

void put(Node list, int i, int x) 

// inserts x as element number 7; 
For example, if list is {22, 33, 44, 55, 66, 77, 88, 99}, then put(list, 3, 50) will change 
list to {22, 33, 44, 50, 55, 66, 44, 88, 99}. Hint: if i = 0, replace the value of the first node 
with x, and insert a new node immediately after it that contains the previous fist value. 


Write and test this method: 

void swap(Node list, int i, int j) 

// swaps the ith element with the jth element; 
For example, if list is {22, 33, 44, 55, 66, 77, 88, 99}, then swap(list, 2, 5) will change 
list to {22, 33, 77, 55, 66, 44, 88, 99}. 


Write and test this method: 

Node merged(Node list1, Node list2) 

// precondition: listl and list2 are both in ascending order; 

// returns: a new list that contains all the elements of listl and 

// list2 in ascending order; 
For example, if list1 is {22, 33, 55, 88} and list2 is {44, 66, 77, 99}, then 
merged(list1, list2) will return the new list {22, 33, 44, 55, 66, 77, 88, 99}. Note that 
the three lists should be completely independent of each other. Changing one list should have 
no effect upon the others. 


Write and test this method: 

void rotateLeft(Node list) 

// moves the first element of the specified list to its end; 
For example, if list is {22, 33, 44, 55, 66, 77, 88, 99}, then rotateLeft(list) will 
change list to {33, 44, 55, 66, 77, 88, 99, 22}. Note that no new nodes are created by this 
method. 


Answers to Review Questions 


Arrays are inefficient for implementing dynamic sorted lists because the insert and delete operations 
require moving half the elements, on average. 


An index array is an array whose elements are index values into another array. 


Linked lists provide no direct access. To find the 100th element, you have to move sequentially 
through the first 99 elements. 


Insertion at the front of a linked list has to be done differently because the link to the new node is the 
start link; it is not p.next for any node p (unless you use a dummy head node). 


We had to define the linked lists backwards in the BigInt class because the digits of an integer are 
processed from right to left in the common arithmetic operations. 
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Solutions to Problems 


3.1 void delete(int[] a, int n, int x) { 
int i= 0; // find the first index i for which a[i] > x: 
while (i < n && a[i] <= x) { 
+413 
} 
if Gi < n-1) { 
System.arraycopy(a, i, a, i-1, n-i); 


} 
a[n-1] = 0; 
} 
3.2 int size(Node list) { 


int count = 0; 

while (list != null) { 
++count; 
list = list.next; 

} 

return count; 


} 


3.3 int sum(Node list) { 
int sum = 0; 
while (list != null) { 
sum += list.data; 
list = list.next; 
} 
return sum; 


} 


3.4 void removeLast(Node list) { 

if Cliist == null || Jist.next == null) { 
throw new IllegalStateExceptionQ ; 

} 

while (list.next.next != null) { 
list = list.next; 

} 

Jist.next = null; 


- 


3.5 Node copy(Node list) { 

if (list == null) { 
return null; 

} 

Node clone = new Node(list.data) ; 

for (Node p=list, q=clone; p.next != null; p=p.next, q=q.next) { 
q.next = new Node(p.next.data) ; 

} 

return clone; 


} 


3.6 Node sublist(Node list, int p, int q) { 
if (m<0 || n<~m) { 
throw new IllegalArgumentExceptionQ() ; 
} else if (n == m) { 
return null; 
} 
for Cint i=0; i<m; i++) { 
list = list.next; 


} 
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Node clone = new Node(list.data) ; 
Node p=list, q=clone; 
for Cint i=m+1; i<n; i++) { 
if (p.next == null) { 
throw new I]1legalArgumentException() ; 


.next = new Node(p.next.data) ; 
= p.next; 
q.next; 


AT QS 


} 


return clone; 


void append(Node list1, Node list2) { 
if Clistl == null) { 


throw new IllegalStateExceptionQ ; 


} 
while (listl.next != null) { 


Jistl = listl.next; 


} 
Jistl.next = list2; 


i 


Node concat(Node list1, Node list2) { 
Node list3 = new Node(0), p=list1l, q=list3; 
while (p != null) { 


q.next = new Node(p.data) ; 


p = p.next; 
q = q.next; 

} 

p = list2; 


while (p != null) { 


q.next = new Node(p.data) ; 
p = p.next; 
q = q.next; 


} 


return list3.next; // discard dummy head node 


} 


voi 
1 


d set(Node list, int i, int x) { 
f dG <0) f{ 
throw new I1legalArgumentException() ; 


} 
for Cint j=0; j<i; j++) f{ 


a 
1 


r 


} 

int 
/ 
N 
1 
w 


} 


if Clist == null) { 
throw new I]llegalStateException() ; 
} 


list = list.next; 


ist.data = x; 
eturn; 
void get(Node list, int i) { 


/ returns the value of element number i; 
ode p = list; 

nt j = 0; 

hile (Gj < i &&p != null) { 

+475 

p = p.next; 
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if (p == null) { 

throw new java.util .NoSuchElementException() ; 
} 
return p.data; 


i 


void put(Node list, int i, int x) { 
// inserts x as element number i; 
if (list == null) { 
throw new java.util .NoSuchElementException("list is empty"); 
} else if Gi == 0) { 
lJist.next = new Node(list.data, list.next); 
list.data = x; 


} else { 
Node p = list; 
int j = 1; 
while (j < 1 && p != null) { 
+475 
p = p.next; 
} 


if (p == null) { 
String error=String.format("the list has onlt %d elements",j-1); 
throw new java.util .NoSuchElementException(error) ; 

} 

p.next = new Node(x, p.next); 


- 


void swap(Node list, int i, int j) { 

ifaq<ollj<of{ 

throw new IllegalArgumentException() ; 
} else if Ci == j) { 

return; 
} 
Node p=list, q=list; 
for Cint i1i=0; ii<i; ii++) { 

if (p == null) { 

throw new I]llegalStateException() ; 


} 
p = p.next; 
} 
for (int jj=0; jj<j; jit++) f 
if (q == null) { 
throw new I]llegalStateException() ; 
} 
q = q.next; 
} 


int pdata = p.data, qdata = q.data; 
p.data = qdata; 

q.data = pdata; 

return; 


} 


Node merged(Node list1, Node list2) { 
Node list = new Node(0), p = list, pl = listl, p2 = list2; 
while (pl != null && p2 != null) { 
if (pl.data < p2.data) { 
p = p.next = new Node(pl.data) ; 
pl = pl.next; 
} else { 
p = p.next = new Node(p2.data); 
p2 = p2.next; 
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} 

while (pl != null) { 
p = p.next = new Node(pl.data); 
pl = pl.next; 

} 

while (p2 != null) { 
p = p.next = new Node(p2.data); 
p2 = p2.next; 

} 


return list.next; 


void rotateLeft(Node list) { 


Node p = list, q = list; 
while (p != null) { 
p = p.next; 
if (p != null) f{ 
p = p.next; 
} 
q = q.next; 
} 
// now q = middle node: 
Node m = q; 
p = list; 
Node t=p.next, tt=m.next; 
while (m.next != null) { 
tt = m.next; 
p.next = m; 
p = m.next = t; 
t = p.next; 
m= tt; 
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CHAPTER 4 


The Java Collections Framework 


The Java Collections Framework (JCF) is a group of classes and interfaces in the java.util 
package. Its main purpose is to provide a unified framework for implementing common data 
structures so that the resulting classes can be used in a consistent, efficient, and intuitive manner. 
This chapter outlines how these types can be used. 


THE INHERITANCE HIERARCHY 


A collection is an object that contains other objects, which are called the e/ements of the 
collection. The JCF specifies four general types of collections: List, Queue, Set, and Map. 
These are defined as interfaces and are extended and implemented with other interfaces and 
classes in the java.util package. 

The relationships among the main classes and interfaces in the JCF are summarized in Figure 
4.1. The interfaces are shown on the right in italics, and the classes that implement them are on 
their left. The dashed lines indicate direct implementation. Inheritance, both between classes and 
between interfaces is shown by solid lines. For example, the AbstractList class extends the 
AbstractCollection class, and the List interface extends the Collection interface. Note 
that some of the class names have the prefix Abstract, which indicates that they include 
abstract methods that are implemented by their subclasses. 

The four main types in the JCF are List, Queue, Set, and Map. A Jist is a sequence of 
elements. A queue is a first-in-first-out collection, like a waiting line. A sef is an unstructured 
collection of distinct elements. A map is a collection of component pairs that works like an index 
or dictionary, where the first component is used to “look up” the information stored in the second 
component. 

The hierarchy shows that the Deque, SortedSet, and SortedMap interfaces extend the Queue, 
Set, and Map interfaces, respectively. These are more specialized types. A Deque is a double- 
ended queue, allowing insertions and deletions at both ends. The SortedSet and SortedMap 
interfaces work only with objects that can be compared, like strings. 

The JCF implements the interfaces with several different data structures. The simplest struc- 
ture is an indexed structure, that is, an array, which is used by the ArrayList, the ArrayDeque, 
the HashSet, and the HashMap classes. Other implementations use a linked structure. The 
LinkedList class uses a doubly linked linear structure, the PriorityQueue class uses a heap 
tree, the TreeSet and TreeMap classes use a linked binary search tree, and the LinkedHashSet, 
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Object 

|t— AbstractCollection- -------------------- Collection 

|— AbstractList- ----------------------- L- List 

t— ArrayList 

t— Vector 

L_ AbstractSequentialList 
Linkedists.<. 25 2226.20.27 22% + Queue 

|— AbstractQueue- ------------------. 

'_ PriorityQueue 

1t— ArrayDeque- ------------------------+---! |_ Deque 

L—AbstractSet =.s- 2542 -secs6 oss c te be be lL Set 

t— EnumSet 

t— HashSet 

LinkedHashSet | SortedSet 

He ==) en ee NavigableSet 

L_ AbstractMap. -------------------------- Map 


t— EnumMap 
t— HashMap 
Ls. LinkedHashMap SortedMap 


'_ TreeMap- ------------------------------- i NavigableMap 


Figure 4.1 The inheritance hierarchy for the Java Collections Framework 


and LinkedHashMap classes use a hybrid linked array structure. The specialized EnumSet and 
EnumMap classes use a bit string. These twelve implementations are summarized in Table 4.1. 


Data Structure List Queue Set Map 
Indexed ArrayList ArrayDeque HashSet HashMap 
Linked LinkedList PriorityQueue | TreeSet TreeMap 
Indexed with links LinkedHashSet | LinkedHashMap 
Bit string EnumSet EnumMap 


Table 4.1 Data structures used by the concrete classes in the JCF 


The java.util package includes four other container classes that were created prior to the 
introduction of the JCF in Java version 1.2. They are the Vector, Stack, Dictionary, and 
Hashtable classes. Since they are not quite consistent with the JCF, they are now regarded as 
“legacy classes.” The Vector and Stack classes have been superseded by the ArrayList class, 
and the Dictionary and Hashtable classes by the HashMap class. 


THE Collection INTERFACE 


The Collection interface provides a basis for most of the other interfaces and classes in the 
JCF. It specifies 15 methods, as shown in the Javadoc page in Figure 4.2 on page 71. Some of 
these methods are obvious: for example, the clear(), contains(), isEmpty(), and size() 
methods. The equals() and hashCode() methods are inherited from the Object class and are 
meant to be overridden. 

The add() method may work differently for different implementations. For collections that 
prohibit duplicate elements, this method should add the specified object only if it is not already 
an element. For collections that allow duplicate elements, the method should add the specified 
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lik > Collection (Java Platform SE 6) - Mozilla Firefox 
File Edit Yiew History Bookmarks Tools Help 


bs @ http: {fjava.sun.com/javase/6/docs/api/java/util/Collection, html? 


Collection (Java Platform SE 6) G ™ Police Say Sex Offender, 29, Posed a... Se) 


Method Summary 
boolean add(E e) 
Ensures that this collection contains the specified element (optional operation). 


boolean) addAll (Collection<? extends E> c} 
Adds all of the elements in the specified collection to this collection (optional operation). 


void! clear () 
Removes all of the elements from this collection (optional operation). 
boolean|contains (Object o} 
Returns true if this collection contains the specified element. 


boolean|containsALl (Collection<?> ¢) 
Returns true if this collection contains all of the elements in the specified collection. 


boolean|equals (Object ao) 
Compares the specified object with this collection for equality. 
int |hashCode (} 
Returns the hash code value for this collection. 
boolean | isEmpty () 
Returns true if this collection contains no elements. 
Iterator<E>|iterator () 
Returns an iterator over the elements in this collection. 


boolean! remove (Object 0) 
Removes a single instance of the specified element from this collection, if it is present (optional 
operation). 


boolean! removeAll (Collection<?> ¢) 
Removes all of this collection's elements that are also contained in the specified collection (optional 
operation). 


boolean) retainAll (Collection<?> ¢) 
Retains only the elements in this collection that are contained in the specified collection (optional 
operation). 
int] size() 
Returns the number of elements in this collection. 
Object] | toArray () 
Returns an array containing all of the elements in this collection. 


<T> TL]|/toArray(T[] a) 
Returns an array containing all of the elements in this collection; the runtime type of the returned array is 
that of the specified array. 


Figure 4.2 Methods specified by the java.util .Col lection interface 


object, possibly as a duplicate. In either case, the method should return true if and only if it 
changed the collection. 

The “contract” imposed by the interface requires any class that implements the add() method 
to throw an exception if the collection does not contain the specified object after the method has 
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been called. For example, if an implement prohibits nu11 elements, then the method should 
throw a Nul11PointerException if the argument is nu11. 

Note that the addQ) method is specified as an “optional operation.” This means that a class 
can implement the Collection interface without providing a working version of the method. In 
that case, it would implement the method like this: 

boolean add(E o) { 
throw new UnsupportedOperationException() ; 
i 

The addA11() method adds all the elements that are in the specified collection. Note the 

parameter specifier: 
Collection<? extends E> c 
This means that c is a collection whose elements’ type extends the type E. 

The containsA11() method is to the addA11() method as the contains() method is to the 
add() method. In particular, it should return true if invoked immediately after a call to the 
addA11() method with the same Collection argument. 

The iterator() method returns an Iterator object that is bound to its implicit argument. 
(Iterators are outlined on page 77.) 

The Collection interface specifies three optional removal methods. The remove() method 
deletes one instance of its specified object, if it is an element. The removeA11() method deletes 
all the elements that are equal to elements in its specified collection. The retainAl1() method 
deletes all the elements that are not equal to elements in its specified collection. 

Note that the addA11(Q), the retainAl1Q), and the removeAl11() methods are equivalent to 
the set-theoretic union, intersection, and complement operators, respectively. These are illus- 
trated in Figure 4.3. 


a b a b a b 


a.addA11(b) a.retainAl1(b) a.removeA11(b) 


Figure 4.3 Union, intersection, and complement operations 


The toArray() method returns an array of references to all the elements in the collection. The 
array returned by the no-argument version has type Object[]. The array returned by the one- 
argument version has the same array type as the argument. 

The Collection interface can be used directly as a specification for a Bag class. A bag is an 
unordered collection that allows duplicate elements. 


THE HashSet CLASS 


The HashSet class is probably the simplest of all the concrete classes in the JCF. Its instances 
represent ordinary sets of unique elements (no duplicates). In fact, a set can be defined simply as 
a collection of unique elements. It extends the AbstractSet class. That class implements the 
Set interface, which extends the Collection interface. (See Figure 4.1 on page 70.) 
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The Set interface specifies the same 15 methods that the Collection interface specifies. (See 
Figure 4.2 on page 71.) The only difference is that the “contract” further stipulates that imple- 
menting classes are obliged to ensure that duplicate elements cannot be added to the collection. 

The AbstractSet class implements the equals() method, the hashCode() method, and the 
removeAl1() method, designating the other 12 methods as abstract. This means that the 
concrete EnumSet, HashSet, and TreeSet classes have their own specialized implementations 
of those 12 methods. 

The HashSet class gets its name from the fact that collections of this type maintain no order 
among their elements; they are all mixed up, like hash. 


EXAMPLE 4.1 Testing the HashSet Class 


This example tests 10 of the 15 methods of the HashSet class. 


1 public class TestHashSet { 
2 public static void main(String[] args) { 
3 Set<String> set = new HashSet<String>Q ; 
4 System.out.printf(C"set.isEmptyQ): %b%n", set.isEmpty()); 
5 Collections.addAll(set, "CN", "FR", "GB", "RU", "US"); 
6 System.out.printIn(set) ; 
7 System.out.printf("set.sizeQ): %d%n", set.size(Q)); 
8 System.out.printf("set.contains(\"GB\"): %b%n", set.contains("GB")); 
9 System.out.printfC"set.contains(\"JP\"): %b%n", set.contains("JP")); 
10 System.out.printf("set.isEmpty(): %b%n", set.isEmptyQ()); 
11 set.add("BR"); 
12 System.out.printIn(set) ; 
13 set.remove("FR"); 
14 System.out.printIn(set) ; 
15 String[] array = set.toArray(new String[0]); 
16 for (String string : array) { 
17 System.out.printf("%s ", string); 
18 } 
19 System.out.printin(""); 
20 Set<String> g8 = new HashSet<String>(); 
21 Collections.addAl1(g8, "CA", "DE", "FR", "GB", "IT", "JP", "RU", "US"); 
22 System.out.printIn(g8) ; 
23 g8.retainAl1 (set) ; 
24 System.out.printIn(g8) ; 
25 set.removeAl1(g8) ; 
26 System.out.printIn(set) ; 
27 set.addAl11(g8) ; 
28 System.out.printIn(set) ; 
29 set.clear(); 
30 System.out.printIn(set) ; 
31 } 
32 } 
The output is: 


set.isEmpty(): true 

[FR, US, RU, GB, CN] 
set.sizeQ): 5 
set.contains("GB"): true 
set.contains("JP"): false 
set.isEmpty(): false 

[FR, US, RU, BR, GB, CN] 
[US, RU, BR, GB, CN] 

US RU BR GB CN 
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[JP, FR, US, RU, GB, DE, IT, CA] 


[US, RU, GB] 

[BR, CN] 

[US, RU, BR, GB, CN] 
C] 


The set object is instantiated at line 3 as a HashSet of String elements. The isEmpty() method is 
tested at line 4 and again at line 10. 

At line 5, the collection is loaded with five elements. This is done by the static addA11Q method 
defined in the java.util.Collections class. Note that the order with which the five elements are 
printed at line 6 is different from the order with which they were added to the set. This illustrates the 
“hash” nature of the set. 

The sizeQ and contains() methods are tested at lines 7-9. The add() method is tested at line 11, 
and the remove() method is tested at line 13. 

At line 15, the one-argument toArray() method is used to generate a String array of the same 
elements that are in the collection. The for loop at lines 16-18 confirms that the array contains the same 
elements. 

Lines 20-21 create and load a second set, named g8. This is then used together with the set collection 
to test the retainAl11(Q, removeA11(), and addA11() methods at lines 23-28. 

Finally, the clear () method is tested at line 29. 


GENERIC COLLECTIONS 


The <String> symbol used twice in line 3 of Example 4.1 means that the elements of the 
collection must be String type: 

3 Set<String> set = new HashSet<String>Q ; 
The types Set<String> and HashSet<String> are called parameterized types or generic types. 
Specifying the element type this way causes the compiler to prevent any non-String objects 
from being inserted into the collection, making it type-safe. 


Note that the parametrized type HashSet<String> extends the parametrized type Set<String> 
because the type HashSet extends the type Set. But HashSet<String> is not an extension of 
HashSet<Object> or of Set<Object>. 


The JCF uses type parameters in most of its classes and interfaces. For example, the complete 
Queue<E> interface is shown in Example 4.2. 


EXAMPLE 4.2 The Queue Interface 


public interface Queue<E> extends Collection<E> { 
boolean offer(E 0); 
E pollQ; 
E remove(); 
E peek(); 
E elementQ); 
} 
The expression Queue<E> extends Collection<E> at line | means that Queue<E> is a specialized 
collection of elements of some type E, to be specified later. Whatever type that is, it must also be the type 
for the parameter o in the of fer() method at line 2 and the return type for the other four methods speci- 
fied at lines 3-6. 
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The variable E used in lines 1-6 of Example 4.2 is called a type parameter. Like a method 
parameter, it stands for a type that can be substituted for it. The actual type that is substituted for 
the type parameter is called a type argument. For example: 

Queue<String> stringQueue = new Queue<String>Q( ; 
Here, the collection stringQueue is instantiated by substituting the type argument String for 
the type parameter E. 


EXAMPLE 4.3 Using Type Arguments 


This program uses two special user-defined types at line 3: an enum Month type defined at line 11 and 
a generic Pair type defined at lines 13-32 


1 public class TestPairClass { 

2 public static void main(String[] args) { 

3 Pair<Month, Integer> christmas = new Pair<Month,Integer>(Month.DEC, 25); 
4 System.out.printInCchristmas) ; 

5 Month month = christmas.getFirstQ; 

6 int day = christmas.getSecond(); 

7 System.out.printfC"%d %s%n", day, month); 

8 } 

9 } 


11 enum Month { JAN, FEB, MAR, APR, MAY, JUN, JUL, AUG, SEP, OCT, NOV, DEC } 


13 class Pair<S, T> { 
14 private S first; 
15 private T second; 
16 
17 public Pair(S first, T second) { 
18 this. first = first; 
19 this.second = second; 
20 } 
21 
22 public S getFirstQ { 
23 return first; 
24 } 
25 
26 public T getSecond() { 
27 return second; 
28 } 
29 
30 public String toString(@ { 
31 return "(" + first + ", " + second + ")"; 
32 } 
33 } 
The output is: 
(DEC, 25) 
25 DEC 


The generic class is defined at line 13. It has two type parameters: S and T. In the code at lines 14, 15, 
17, 22, and 26, these two type parameters are used as place holders for actual types. 

Note that the <S,T> expression is not used in the constructor definition at line 18. But the <> construct 
is required when the constructor is invoked, as at line 3. 
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If you compile a program using nongeneric JCF classes (i.e., without specifying a type 
arguments for the element type) you’re likely to get a compiler message like this: 
uses unchecked or unsafe operations. 
Note: Recompile with -Xlint:unchecked for details. 
You can avoid that simply by specifying an element type; even Object itself, like this: 
List<Object> list = new ArrayList<Object>Q; 


GENERIC METHODS 


In addition to generic types, type parameters can also be used to define generic methods, 
identified by the generic parameter specifier <T> placed in front of the return type. 


EXAMPLE 4.4 A Generic Method 


public class TestPrint { 


1 

2 public static void main(String[] args) { 
3 args = new String[]{"CA", "US", "MX", "HN", "GT"}; 
4 print(args); 

5 } 

6 

7 static <E> void print(E[] a) f{ 

8 for (E ae: a) { 

9 System.out.printf("%s ", ae); 

10 } 

1 System.out.printInQ); 

12 } 

13 } 


The output is: 
CA US MX HN GT 
The method is identified as generic by the <E> specifier at line 7. This allows the type parameter E to be 
used in place of an actual type in the method block at line 8. 


GENERIC WILDCARDS 


The symbol ? can be used as a wildcard, in place of a generic variable. It stands for “unknown 
type,” and is called the wildcard type. 


EXAMPLE 4.5 A Universal printQ) Method 


static void print(Collection<?> c) { 
for (Object o : c) { 
System.out.printfC"%s 


1 
2 
3 , 0) 
4 } 
5 System.out.printinQ; 

6 i 

This method can be used to print any type of collection, for example, a HashSet<String> or a 
Queue<Date>. 

Note that if we used Collection<Object> instead of Collection<?> at line 1, then the method 
would only apply to collections whose element type is specified as Object. For example, it could not be 
used to print a HashSet<String> because that is not an extension of Collection<Object>. 
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The wildcard type can be used more selectively to limit the range of types that can be used in 
its place. For example, if you wanted to restrict the print() method in Example 4.5 to exten- 
sions of a Person class, then you would replace line 1 with: 

1 static void print(Collection<? extends Person> c) { 
The expression ? extends Person is called a bounded wildcard type. 


ITERATORS 


An iterator is an object that provides access to the elements of a Collection object. It acts 
like a finger, pointing to one element at a time, much like the cursor of a text editor points to one 
character at a time. And like a cursor, an iterator has the capability of moving from one element 
to the next and of deleting its current element. 

The analogy between iterators and cursors is not complete. Most iterators cannot jump around 
the way cursors can. On the other hand, it is possible to have several independent iterators 
traversing the same collection. 

The algorithm that determines each “next” element in the iterator’s path is an intrinsic 
property of the iterator itself. Moreover, it is possible to have several different iterator classes 
defined for the same collection class, each with its own traversal algorithms. 

The requirements for an iterator are specified in the java.util.Iterator interface, shown 
in Figure 4.4. 


File Edit View History Bookmarks Tools Help 
Gt | “uf sun] http://java.sun.comfjavase/6/docs/api/javajutil/Iterator.html | * | De IGl- 


Bf Iterator (Java Platform SE 6) (3) Police Say Sex Offender, 29, Posed a... 


Method Summary 


boolean |/hasNext () 


Returns true if the iteration has more elements. 


Elnext () 
Returns the next element in the iteration. 


void! remove ()} 


Removes from the underlying collection the last element returned by the iterator (optional operation). 


Figure 4.4 Methods specified by the java.util . Iterator interface 


The next() method returns the current element of the collection to which the iterator is 
bound. Each time the next() method is invoked, it advances the iterator to the next element in 
the collection. 

The hasNext() method returns true unless the iterator has reached the end of its traversal. 
This predicate is used to determine whether the next() method can be called again. 

The remove() method deletes the last element returned by the next() method. This means 
that next) must be called before each call to remove(), designating the element to be deleted. 

Normally, iterators are obtained by means of a call to the collection’s iterator() method 
that is required by the Collection interface. (See Figure 4.2 on page 71.) It returns a new itera- 
tor, bound to the collection and initialized to its first element. 
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EXAMPLE 4.6 Using an Iterator 


1 public class TestIterators { 
2 public static void main(String[] args) { 

3 Set<String> port = new HashSet<String>(); 

4 Collections.addAll(port, "AO", "BR", "CV", "GW", "MO", "MZ", "PT"); 
5 System.out.printIn(port) ; 

6 Iterator itl = port.iterator(); 

7 System.out.printfC"itl.nextQ: %s%n", itl.nextQ); 

8 System.out.printfC"itl.nextQ: %s%n", itl.nextQ); 

9 System.out.printfC"itl.nextQ: %s%n", itl.nextQ); 


10 System.out.printfC"itl.nextQ: %s%n", itl.nextQ); 
"1 itl.remove(); 

12 System.out.printIn(port) ; 

13 System.out.printfC"itl.nextQ: %s%n", itl.nextQ); 
14 itl.remove(); 

15 System.out.printIn(port) ; 

16 Iterator it2 = port.iterator(); 

17 while(it2.hasNextQ)) { 

18 System.out.printf("%s ", it2.nextQ); 

19 } 

20 System.out.printin(""); 

21 System.out.printfC"itl.nextQ: %s%n", itl.nextQ); 
22 } 

23 } 


The output is: 
[MZ, BR, PT, MO, GW, CV, AO] 
itl.nextQ): MZ 
itl.nextQ): BR 
jtl.nextQ: PT 
jtl.nextQ: MO 
[MZ, BR, PT, GW, CV, AO] 
itl.nextQ: GW 
[MZ, BR, PT, CV, AO] 
MZ BR PT CV AO 
itl.nextQ: CV 
The Iterator object it1 is instantiated at line 6 and returned by the iterator() method. Bound to 
the port collection, it visits the elements in the same order as they were printed at line 5. Note that this 
order is not the same as the order in which the elements were added to the collection. Since the collection 
is a set, its elements have no intrinsic order. It is the iterator itself that computes the traversal sequence. 
You can see from the output from line 5 that the toString() method (invoked implicitly by the 
printInQ method) uses the same traversal sequence, obviously using its own iterator to build the string. 
The it1 iterator’s remove() method is invoked at lines 11 and 14. This causes it to delete the “current” 
element, which in each case is the last element returned by a call to next(). At line 11, the last nextQ 
call was at line 10, which returned the MO element; so MO is deleted by remove() at line 11. Similarly, at 
line 14, the last next © call was at line 13, which returned the GW element; so GW is deleted by remove() 
at line 14. These deletions are evident from the output produced by the while loop at lines 17—19. Notice 
that that loop uses the independent iterator it2 to traverse the collection. 
Note that the code at lines 16-19 make a complete traversal of the collection. This is guaranteed by the 
instantiation of the new iterator at line 16 and the use of the hasNext () method to control the loop. 
The last output, generated at line 21, confirms that the two iterators it1 and it2 are independent. The 
action of it2 had no effect on it1, which returns the CV element at line 21. 
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The iterator it2 in Example 4.6 used a whi le loop to traverse the collection. Iterators can be 
used in a for loop the same way: 
for (Iterator it = countries.iterator(); it.hasNextQ); ) f{ 
System.out.printf("%s ", it.nextQ); 
} 
This is analogous to using a for loop to traverse an array: 
for Cint i = 0; i < countries.length; i++) { 
System.out.printf("%s ", countries[i]); 
} 
Java 5.0 introduced the for-each construct for simplifying for loops: 
for (String country : countries) { 
System.out.printf("%s ", country); 
} 
Here, the String variable country takes the place of the indexed expression countries [i]. 
This same code also replaces the iterator loop! Thus, the program in Example 4.6 works the same 
way if we replace lines 16—19 with this simpler for-each loop: 
for (String country : port) { 
System.out.printf("%s ", country); 
} 4 
This code implicitly invokes the port collection’s 
iterator( method, generating an implicit itera- én _ 
tor, which then implicitly invokes the hasNext () y Sa 
and next() methods within the loop to traverse 


ws 
the collection. The declared variable country (A 
takes the place of the expression it.next() 
inside the loop. 
Figure 4.5 illustrates the iterator it2 that 
traverses the port collection in Example 4.6. The Us 


essential feature is that the iterator locates one 
element at a time in the collection. It is up to the 
collection’s iterator() method to determine the 
algorithm by which its iterator traverses the 
collection. 


Figure 4.5 The iterator it2 on the set port 


THE TreeSet CLASS 


In addition to the Set interface, the TreeSet class also implements 6 methods specified by the 
SortedSet interface (Figure 4.6) and 13 methods specified by its NavigableSet extension 
(Figure 4.7). These additional 19 methods require implementing classes to maintain an ordering 
mechanism among their elements, allowing them to be compared for size. 

In Java, there are two ways for objects to be compared: either by means of their natural order- 
ing or by the application of an external Comparator object. Classes whose objects enjoy a 
natural order implement the comparable interface. These include the wrapper classes (Integer, 
Double, etc.), the BigInteger and BigDecimal classes in the java.math package, and the 
String class. 

User-defined orderings can be defined for a class by implementing the Comparator interface, 
which is part of the JCF. It specifies a compare() method, which returns an int that indicates 
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Method Summary 


Comparator<? | comparator () 
super. E> Returns the comparator used to order the elements in this set, or nul1 if this set uses the natural 
ordering of its elements. 


first () 
Returns the first (owest) element currently in this set. 


Sortedset<E>|/headSet (E toElement) 
Returns a view of the portion of this set whose elements are strictly less than toElement. 


last () 
Returns the last (highest) element currently in this set. 
Sortedset<E>|/subSet (E fromElement, E toElement) 
Returns a view of the portion of this set whose elements range from fromE lement, inclusive, to 
toElement, exclusive. 


Sortedset<E>|/tailSet (E fromElement) 
Returns a view of the portion of this set whose elements are greater than or equal to fromE lement. 


|v 


Figure 4.6 Specialized methods specified by the java.util .SortedSet interface 


how its two arguments are ordered: compare(x,y) > 0 means that x is greater than y, 
compare(x,y) <0 means that x is less than y, and compare(x,y) == 0 means that x equals y. 
A Comparator object can be passed to a SortedSet constructor to specify how the elements 
should be ordered. 


EXAMPLE 4.7 Constructing a TreeSet Collection with a Comparator Object 


1 public class TestTreeSetWithComparator { 

2 public static void main(String[] args) { 

3 SortedSet<String> ital = new TreeSet<String>Cnew RevStringComparator()); 
4 Collections.addAl]lCital, "IT", "VA", "SM", "CH"); 

5 System.out.printInCital); 

6 } 

7 } 

8 

9 class RevStringComparator implements Comparator<String> { 
10 public int compare(String sl, String s2) { 

11 StringBuilder sb1 = new StringBuilder(s1); 

12 StringBuilder sb2 = new StringBuilder(s2) ; 

13 String slrev = sbl.reverse().toStringQ ; 

14 String s2rev = sb2.reverse().toStringQ; 

15 return slrev.compareTo(s2rev) ; 

16 } 

17 } 


The output is: 
[VA, CH, SM, IT] 
The separate RevStringComparator class implements the Comparator interface for String objects. 
It defines its compare() method by applying the String class’s CompareTo() method to the reversed 
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Method Summary 


ceiling(E e) 
Returns the least element in this set greater than or equal to the given element, or nul if there is 
no such element. 


Iterator<E> | descendingIterator (} 
Returns an iterator over the elements in this set, in descending order. 
NavigableSet<E> descendingSet (} 
Returns a reverse order view of the elements contained in this set. 


floor(E e) 
Returns the greatest element in this set less than or equal to the given element, or nul if there is 
no such element. 
Sortedset<E>|/headSet (E toElement) 
Returns a view of the portion of this set whose elements are strictly less than toElement. 


Navigableset<E>|/headSet (E toElement, boolean inclusive} 
Returns a view of the portion of this set whose elements are less than (or equal to, if inclusive 
is true) toElement. 


higher (E e) 
Returns the least element in this set strictly greater than the given element, or null if there 1s no 
such element. 


Iterator<E> | iterator () 
Returns an iterator over the elements in this set, in ascending order. 


lower (E e) 
Returns the greatest element in this set strictly less than the given element, or null if there 1s no 
such element. 


pollFirst () 
Retrieves and removes the first dowest) element, or returns null if this set is empty. 
poliLast () 
Retrieves and removes the last (highest) element, or returns null if this set is empty. 


Navigableset<E>|subSet (E fromElement, boolean fromInclusive, E toElement, boolean toInclusive) 
Returns a view of the portion of this set whose elements range from fromElement to 
toElement. 


sortedset<E>|subSet (E fromElement, E toElement) 
Returns a view of the portion of this set whose elements range from fromE lement, inclusive, to 
toElement, exclusive. 


Sortedset<E>|tailSet (E fromElement) 
Returns a view of the portion of this set whose elements are greater than or equal to 
fromElement. 


Navigableset<E>|/tailSet (E fromElement, boolean inclusive) 
Returns a view of the portion of this set whose elements are greater than (or equal to, if 
inclusive is true) fromElement. 


Figure 4.7 Specialized methods specified by the java.util .Navigab] eSet interface 
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strings. As a result, the SortedSet collection orders its elements by applying alphabetical ordering to 
their reversed strings. Since AV < HC < MS < TI (alphabetically), the order of the four inserted elements is 
as shown in the output from line 5. 


If the TreeSet collection is instantiated without an explicit Comparator object, then it uses 
its elements’ natural ordering. 


EXAMPLE 4.8 Testing the TreeSet Class 


1 public class TestTreeSet { 

2 public static void main(String[] args) { 

3 NavigableSet<String> engl = new TreeSet<String>() ; 

4 Collections.addAllCengl, "IN", "US", "PK", "NG", "PH", "GB", "ZA"); 
5 System.out.printIn(Cengl); 

6 engl.add("KE") ; 

7 System.out.printIn(Cengl); 

8 SortedSet<String> head = engl.headSet("KE") ; 

9 SortedSet<String> mid = engl.subSetC("KE", "US"); 

10 SortedSet<String> tail = engl]l.tailSet("US"); 

1 System.out.printf("%s %s %s%n", head, mid, tail); 

12 System.out.printfC"engl.firstQ): %s%n", engl.firstQ); 
13 System.out.printfC"engl.lastQ: %s%n", engl.lastQ); 
14 } 

15 } 


The output is: 
[GB, IN, NG, PH, PK, US, ZA] 
[GB, IN, KE, NG, PH, PK, US, ZA] 
[GB, IN] [KE, NG, PH, PK] [US, ZA] 
engl.firstQ: GB 
engl.lastQ: ZA 
eng]. lower("KE"): IN 
eng]. higherC"KE"): NG 

The set eng] is instantiated as a TreeSet of String elements at line 3 and loaded with seven elements 
at line 4. An eighth element is added at line 6. The outputs from lines 5 and 7 confirm that the TreeSet is 
maintaining its elements in their natural (alphabetical) order. 

Lines 8—13 illustrate some of the specialized methods implemented by the TreeSet class. (See Figure 
4.6 on page 80.) The headSet() method returns the sorted subset of all elements that precede its 
argument. The subSet() method returns the sorted subset of elements that begin with its first argument 
and precede its second argument. The tailSet() method returns the sorted subset of all elements that do 
not precede its argument. Note that these three methods adhere to the Java’s “left-continuous” policy that 
whenever a linear segment is to be specified by two bounding points a and b, the segment will include the 
lower element a and exclude the upper element b. For example, the subset [KE, NG, PH, PK] returned 
by the call eng].subSet("KE", "US") at line 9 includes the lower element KE and excludes the upper 
element US. 


The intrinsic difference between the HashSet and TreeSet classes is their backing data struc- 
ture. The HashSet class uses a hash table (outlined in Chapter 8), which uses each element’s 
hashCode() to compute its location in the set. The TreeSet class uses a balanced binary search 
tree (outlined in Chapter 12) to store its elements. The advantages and disadvantages of these 
two data structures are summarized in Table 4.2. 

These relative advantages and disadvantages make the choice easy: If you want to preserve the 
natural order of the elements in your set, use the TreeSet class; otherwise, use the HashSet 
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Data structure Advantage Disadvantage 
Hash table Access time is independent of the size n of the collection | Elements are stored in random order. 
Search tree Elements are accessible in their natural order Access time is proportional to logn 


Table 4.2 Hash tables versus search trees 


class. In practice, the slower access time of the TreeSet class is usually not noticeable, unless 
the set is really large. For example, if » < 1000, there’s really no disadvantage in using the 
TreeSet class. 


THE LinkedHashSet CLASS 


The LinkedHashSet class is the same as the HashSet class, except that it maintains a doubly 
linked list of the elements in the order in which they are inserted. This overcomes the random 
ordering of the HashSet class. The only disadvantage is slightly slower insertion and deletion 
times, which in most cases would be unnoticeable. 


EXAMPLE 4.9 Testing the LinkedHashSet Class 


public class TestLinkedHashSet { 
public static void main(String[] args) { 
Set<String> ital = new LinkedHashSet<String>Q ; 
Collections.addAl]lCital, "IT", "VA", "SM", "CH"); 
System.out.printInCital); 
ital.removeC"VA"); 
System.out.printInCital); 
jital.add("VA"); 
System.out.printInCital); 
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} 
} 

The output is: 

[IT, VA, SM, CH] 

[IT, SM, CH] 

[IT, SM, CH, VA] 
This program creates the same ital set as in Example 4.8 on page 82. The output from line 5 confirms 
that its access order matches the order in which the elements were inserted. The output from line 9 shows 
that the add() method inserts at the end of the linked list. 
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THE EnumSet CLASS 


The EnumSet class is designed for elements of an enum type (enumerated type). For example, 
enum Day { SUN, MON, TUE, WED, THU, FRI, SAT } 
If an application uses sets of days, then those sets can be implemented most efficiently as 
EnumSet objects using an enum type like this. 

In addition to the 15 methods required by the Set interface, the EnumSet class also imple- 
ments the 13 additional methods specified in its Javadoc page, shown in Figure 4.8. Notice that 
each of these methods returns an EnumSet object, and every one except clone() is a static 
method. These are used to instantiate the sets, like this: 

EnumSet<Day> weekdays = EnumSet.range(Day.MON, Day.FRI); 
EnumSet<Day> weekend = EnumSet.of(Day.SUN, Day.SAT); 
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Method Summary 


static) 5110£ (Class<E> elementType) 
<E extends EnumE>> |~____— én ‘ ‘ 
EnunSetcE> Creates an enum set containing all of the elements in the specified element type. 


static | complement Of (EnumSet<E> s) 
a Cr eons Creates an enum set with the same element type as the specified enum set, initially containing 
all the elements of this type that are xof contained in the specified set. 
static 


extends Enum<E>> 
Enumset<E> 


copyOf (Collection<E> cj 
Creates an enum set initialized from the specified collection. 


static | copy0f (EnumSet<E> s) 

asi ae te Creates an enum set with the same element type as the specified enum set, intially containing 
the same elements (if any). 

static 


extends Enum<E>> 
Enumset<E> 


noneOf (Class<E> element Type) 
Creates an empty enum set with the specified element type. 
static 


extends Enum<E>> 
Enumset<E> 


e) 
Creates an enum set initially contaming the specified element. 


static 


first, E... rest) 
extends Enum<E>> C initiall sane th fed el 
EnumSet<E> reates an enum set imtially containing the specified elements. 


static el, E e2} 


extends Enum<E>> C initial ing th ified el 
Enum$et<E> reates an enum set initially containing the specified elements. 


static e1, E eZ, E 23) 
extends Enum<E>> ae in i 
EnumSet<E> Creates an enum set initially containing the specified elernents. 


static el, E e2, E e3, E e4) 
extends Enum<E>> |—— ‘e initial! ining th ified el 
EnumSet<E> reates an enum set initially containing the specified elements. 


staticlor(e ei, E ez, E e3, E e4, E eS} 


extends Enum<E>> C initiall ‘nine th fied el 
EnumSet<E> reates an enum set intially contamng the specihed elements. 


static|range(E from, E to) 
cee fr Creates an enum set initially containing all of the elements in the range defined by the two 
specified endpoints. 


Figure 4.8 Specialized methods implemented by the java.util .EnumSet class 


EXAMPLE 4.10 Testing the EnumSet Class 


1 public class TestEnumSet { 


public static enum Month { JAN, FEB, MAR, APR, MAY, JUN, 
JUL, AUG, SEP, OCT, NOV, DEC } 


a fF wo ND 
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6 public static void main(String[] args) { 
7 EnumSet<Month> spring = EnumSet.range(Month.MAR, Month. JUN); 
8 System.out.printIn(spring) ; 
9 System.out.printInCEnumSet.complementOf (spring) ) ; 
10 EnumSet<Month> shortMonths = 
11 EnumSet.of(Month.SEP, Month.APR, Month.JUN, Month.NOV, Month. FEB); 
12 System.out.printInCshortMonths) ; 
13 shortMonths.addAl1(spring) ; 
14 System.out.printInCshortMonths) ; 
15 } 
16 } 
The output is: 


[MAR, APR, MAY, JUN] 

[JAN, FEB, JUL, AUG, SEP, OCT, NOV, DEC] 
[FEB, APR, JUN, SEP, NOV] 

[FEB, MAR, APR, MAY, JUN, SEP, NOV] 

The enum type Month is defined at line 3. Variables of that type can have only those 12 specified values 
(or null). At line 7, the EnumSet spring is instantiated by invoking the static range() method of the 
EnumSet class. Notice that this method does not adhere to Java’s “left-continuous” policy (see page 82): 
the resulting set includes both of the specified elements MAR and JUN. 

The EnumSet class’s complementOf() and of () methods are invoked at lines 9 and 11, respectively. 
These work as described in the Javadoc, shown in Figure 4.8 on page 84. The addA11() method is tested 
at line 13, producing the union of the shortMonths and spring sets. 


THE List INTERFACE 


A list is a linearly ordered data structure in which every element (except possibly the first) has 
a predecessor element and every element (except possibly the last) has a successor element. List 
elements are usually numbered in sequence: x9, x;, X, . . . The numbers are called indexes or 
subscripts. Unlike sets, list data structures usually allow duplicate elements. 

The List interface extends the Collection interface. (See Figure 4.1 on page 70.) In 
addition to the 15 methods specified by the Collection interface (Figure 4.2 on page 71), the 
List interface also specifies these 10 additional methods: 

voidadd(int index, E element) 
boolean addAl1Cint index, Collection<? extends E> c) 
E getCint index) 
int indexOf (Object o) 
int lastIndexOf(Object o) 
ListIterator<E> listIterator( 
ListIterator<E> listIterator(int index) 
E remove(int index) 
E set(Cint index, E element) 
List<E> subList(Cint fromIndex, int toIndex) 
These methods use indexed access into the list. 
The indexed addA11() method has a second parameter with this generic type: 
Collection<? extends E> 
The ? symbol, called a generic wildcard, means that the type is unknown. The expression 
? extends E means the unknown type must be an extension of the collection’s element type E. 
The ListIterator<E> type is outlined on page 87. 
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THE ArrayList AND Vector CLASSES 


The ArrayList class uses an array to implement the List interface. When the array becomes 
full, the add() method resizes it by replacing it with one that is twice as big. That is time- 
consuming, but it happens infrequently. 

The array index allows the get() and set() methods to run in constant time, independent of 
the size of the collection. Also, the no-argument add() method runs in amortized constant time, 
which means that the time it takes to insert n elements is (on average) proportional to n. It does 
so by appending the new elements to the end of the list. 

The indexed add() and remove() methods have to shift subsequences of elements back and 
forth in the array to accommodate the inserted elements and to fill the gaps left by the deleted 
elements. Consequently, these operations run in /inear time, which means that the time is (on 
average) proportional to the size of the collection. 

The Vector class is similar to the ArrayList class, using a resizable array to store its 
elements. It has 45 methods, including the 3 that it inherits from the AbstractList class. In 
addition to those 3, the 15 methods specified by the Collection interface (Figure 4.2 on page 
71), and the 10 methods specified by the List interface, it implements these 17 Vector 
methods: 

void addElement(E obj) 

int capacity() 

void copyInto(Object[] anArray) 

E elementAtCint index) 

Enumeration<E> elements() 

void ensureCapacityCint minCapacity) 

E firstElement () 

int indexOf(Object o, int index) 

void insertElementAtCE obj, int index) 

E lastElement() 

int lastIndexOf (Object 0, int index) 

void removeAl1Elements() 

boolean removeElement (Object obj) 

void removeElementAtCint index) 

void setElementAt(E obj, int index) 

void setSize(int newSize) 

void trimToSize(Q) 
These date back to Java 1.0, which preceded the JCF. Most of them are redundant. For example, 
the removeElement (Object) method is the same as the remove(Object) method specified by 
the Collection interface, and the removeElementAt(int) method is the same as the 
remove(int) method specified by the List interface. 


THE LinkedList CLASS 


The LinkedList class uses a doubly linked list to implement the List interface. In addition 
to the 25 methods specified by the Collection and List interfaces, this class also implements 
these 11 other methods: 

void addFirst(E o) 
void addLast(E 0) 
E element() 
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E getFirstQ 
E getLastQ 
boolean offer(E 0) 
E peekQ) 
pollQ 
remove() 
removeFirst() 

E removeLast() 

The add, get, and remove methods that refer to first and last access the first element and 
last element of the list, respectively. For example, the call 

list.addFirst(x); 
would add the object x to the front of the list, making it the first element. 

Note that the three remove methods listed here are all obviated by the remove(int) method 
specified by the List interface. The LinkedList methods remove() and removeFirst() are 
the same as the List method remove(0). 

The other five new methods (element(), offerQ, peekQ, poll(), and remove()) are 
outlined below. 


mmm 


THE ListIterator INTERFACE 


The ListIterator interface extends the Iterator interface by specifying the six additional 
methods shown in Figure 4.9. These extra methods require ListIterator objects to be bidirec- 
tional. Thus, previous(), hasPrevious(), and previousIndex() act the same way as 
next(), hasNext(),and nextIndex(), respectively, except in the reverse direction. 

The ListIterator interface also adds two more “optional requirements”: the add() and 
set() methods, to accompany the optional remove() method. Recall (page 72) that the purpose 
of specifying an “optional” method in an interface is to recommend the method signature for that 
method if it is implemented. To avoid implementing the method, the class can simply throw a 
new UnsupportedOperationException object, as shown on page 72. 


THE Queue INTERFACE 


A queue is a waiting line. In computing, a queue is a linear data structure that allows insertions 
only at one end and deletions only at the other end. As a dynamic data structure, this embodies 
the first-in-first-out (FIFO) protocol. The most familiar use of the queue data structure is a print 
queue, which temporarily holds print jobs for a printer. 

The Queue interface was added to the JCF with Java 5.0. Reflecting the print queue prototype, 
the Javadoc states that a queue is a “collection designed for holding elements prior to process- 
ing.” It specifies five specialized methods in addition to the 15 methods specified by the 
Collection interface, overriding the add() method. (See Figure 4.10 on page 89.) 

The add() and offer() methods insert the specified element at the back of the queue. The 
only difference between them is their behavior when the queue is full: addQ throws an 
unchecked exception, while of fer() returns false. 

The element() and peek() methods return the element at the front of the queue. The only 
difference between them is their behavior when the queue is empty: element() throws a 
NoSuchE]ementException, while peek() returns nul1. 
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Method Summary 
void add(E e) 
Inserts the specified element into the list (optional operation). 
boolean|hasNext () 
Returns true if this list tterator has more elements when traversing the list in the forward direction. : 
boolean |hasPrevious () j 
Returns true if this list tterator has more elements when traversing the list in the reverse direction. 


E/next ()} 

Returns the next element in the lst. 

int |next Index () 
Returns the index of the element that would be returned by a subsequent call to next. 

previous () 

Returns the previous element in the list. 

int | previousIndex () 
Returns the index of the element that would be returned by a subsequent call to previous. 

void! remove ()} 

Removes from the list the last element that was returned by next or previous (optional operation). 


void set(E e) 
ea Replaces the last element returned by next or previous with the specified element (optional operation). 


Figure 4.9 The java.util .ListIterator interface 


The remove() and pol1l() methods delete and return the element at the front of the queue. 
The only difference between them is their behavior when the queue is empty: remove() throws a 
NoSuchElementException, while poll () returns nul11. 

The AbstractQueue class implements 5 of the 20 required methods of the Queue interface: 
add(), addA11(), clear(Q), element(), and remove(). The purpose of this class (as with all 
abstract classes) is to serve as a partial implementation, to be completed in a concrete exten- 
sion by implementing the other required methods. The Javadoc for the AbstractQueue class 
states: “A Queue implementation that extends this class must minimally define a method 
Queue.offer(E) which does not permit insertion of null elements, along with methods 
Queue.peek(Q), Queue.poll(Q), Collection.sizeQ), and a Collection.iteratorQ 
supporting Iterator. remove().” 


EXAMPLE 4.11 Implementing a LinkedQueue Class 


public class TestQueue { 
public static void main(String[] args) { 
Queue<String> queue = new LinkedQueue<String>() ; 
Collections.addAl1l(queue, "AR", "BO", "CO", "EC"); 
System. out.printIln(Cqueue) ; 
String firstOut = queue. remove(); 
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Method Summary 


boolean add(E e) 
Inserts the specified element into this queue if it is possible to do so immediately without violating capacity 
restrictions, returning true upon success and throwing an IllegalStateException ifno space is currently 
available. 


element () 
Retrieves, but does not remove, the head of this queue. 


boolean] offer (E e) 
Tnserts the specified element into this queue if it is possible to do so immediately without violating capacity 
restrictions. 


Figure 4.10 Specialized methods specified by the java.util .Queue interface 


7 System. out.printIlnCqueue) ; 

8 System.out.printfC"Removed %s%n", firstOut); 
9 queue.add("PE"); 

10 System. out.printIln(Cqueue) ; 

1 String secondOut = queue.remove() ; 

12 System. out.printIln(Cqueue) ; 

13 System.out.printfC"Removed %s%n", secondOut) ; 
14 } 

15 } 

16 

17 class LinkedQueue<E> extends AbstractQueue<E> implements Queue<E> { 
18 private List<E> list = new LinkedList<E>(); 

19 

20 public Iterator<E> iterator() { 

21 return list.iteratorQ; 

22 } 

23 

24 public boolean offer(E e) { 

25 if Ce == null) { 

26 return false; 

27 } else { 

28 list.add(e); 

29 return true; 

30 } 


31 } 
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33 public E peekQ) { 

34 return list.get(0); 

35 } 

36 

37 public E pollQ® { 

38 if Clist.isEmptyQO) { 
39 return null; 

40 } else { 

a return list. remove(0); 
42 } 

43 } 

44 

45 public int sizeQ { 

46 return list.sizeQ; 
47 } 

48 } 


The output is: 
[AR, BO, CO, EC] 
[BO, CO, EC] 
Removed AR 
[BO, CO, EC, PE] 
[CO, EC, PE] 
Removed BO 
The LinkedQueue class uses a LinkedList to store its elements at line 18. This is called composition 
of classes; a LinkedQueue object is composed of a LinkedList object. It allows the composing class’s 
methods to be implemented using the component class’s methods. Thus, offer() uses list.addQ) at 
line 28, peekQ uses list.get() at line 34, pol1Q uses list.remove() at line 41, and sizeQ uses 
list.size() at line 46. 
The action of the mainQ@ method illustrates the FIFO nature of a queue: Insert elements at the back and 
remove them from the front. Thus, the first in (AR) was the first out (at line 6), and the second in (BO) was 
the second out (at line 11). 


THE PriorityQueue CLASS 


A priority queue is the same as an ordinary queue except for its removal algorithm: Instead of 
removing the element that has been in the queue the longest, it removes the element that has the 
highest priority. This of course, requires its elements to be prioritized. In other words, the 
elements have to have some ordering mechanism; either a natural order, such as alphabetic order 
for strings, or an order imposed by a Comparator object. (See page 79.) 

The PriorityQueue class extends the AbstractQueue class, implementing the Queue inter- 
face. To accommodate the priority protocol, it includes a constructor for specifying a 
Comparator, a constructor for specifying a SortedSet source of elements, and an accessor 
method for obtaining the elements’ comparator: 

PriorityQueue(int initialCapacity, Comparator<? super E> comparator) 

PriorityQueue(SortedSet<? extends E> c) 

Comparator<? super E> comparator () 
Note that the first of these two constructors requires the collection’s initialCapacity to be 
specified. That is because the PriorityQueue class is implemented with a heap data structure, 
which uses an array to store its elements. (See Chapter 13.) 
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EXAMPLE 4.12 Testing the PriorityQueue Class 


1 public class TestPriorityQueue { 
2 public static void main(String[] args) { 
3 Queue<String> queue = new PriorityQueue<String>() ; 
4 Collections.addAl1l(queue, "CO", "UY", "EC", "AR"); 
5 System.out.printfC"Removed %s%n", queue. remove()); 
6 System.out.printfC"Removed %s%n", queue. remove()); 
7 queue.add("PE") ; 
8 queue.add("BO"); 
9 System.out.printfC"Removed %s%n", queue. remove()); 
10 System.out.printfC"Removed %s%n", queue. remove()); 
1 } 
12 } 
The output is: 
Removed AR 
Removed CO 
Removed BO 
Removed EC 
The queue collection is instantiated at line 3 as a PriorityQueue of Strings. Consequently, the 
remove() method always removes the element that comes first alphabetically. 


THE Deque INTERFACE AND ArrayDeque CLASS 


A deque (pronounced “deck’’) is a double-ended queue, that is, a linear data structure that 
allows insertions and deletions at both ends but nowhere else. 

The Deque interface extends the Queue interface. (See Figure 4.1 on page 70.) In addition to 
the 15 methods specified by the Collection interface and the 5 more methods specified by the 
Queue interface, the Deque interface specifies these 17 methods: 


void addFirst(E e) 

void addLast(E e) 

Iterator<E> descendingIterator() 

E getFirst() 

E getLastQ 

boolean offerFirst(E e) 

boolean offerLast(E e) 

E peekFirstQ 

E peekLast(Q) 

E pollFirstQ 

E pollLastQ 

E pop 

void push(CE e) 

E removeFirst() 

E removeLast() 

boolean removeFirstOccurrence(Object o) 
boolean removeLastOccurrence(Object 0) 


Notice how these methods come in pairs, one each for each end of the deque: first for the front 
of the deque, and last for the back. 

The descendingIterator() method is the same as the iterator() method, except that it 
moves in the opposite direction. 
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The pop() and push() methods are the same as the removeFirst() and addFirst() 
methods, respectively. They are the traditional names for the insert and delete methods for a 
stack data structure, which is a deque that allows access at only one end. (See Chapter 5.) 

The ArrayDeque class implements the Deque interface by using a resizable array to store its 
elements. The Deque interface and its ArrayDeque implementation were added to the JCF with 
Java 6. 


EXAMPLE 4.13 Using the ArrayDeque Class 


1 public class TestArrayDeque { 

2 public static void main(String[] args) { 

3 Deque<String> deque = new ArrayDeque<String>() ; 

4 Collections.addAl1l(deque, "GB", "DE", "FR", "ES"); 

5 System. out.printIn(Cdeque) ; 

6 System.out.printInC"deque.getFirstQ): " + deque.getFirstQ)); 
7 System. out.println(Cdeque) ; 

8 System.out.printInC"deque.removeFirst(): " + deque.removeFirstQ)); 
9 System. out.printIn(Cdeque) ; 

10 System.out.printInC"deque.addFirst(\"IT\"):"); 

Tl deque.addFirst("IT"); 

12 System. out.printIn(deque) ; 

13 System.out.printInC"deque.getLast(): " + deque.getLast(Q)); 
14 System. out.println(deque) ; 

15 System.out.printInC"deque.removeLast(): " + deque.removeLastQ); 
16 System. out. print ln(Cdeque) ; 

17 System.out.printInC"deque.addLast(\"IE\"):"); 

18 deque.addLast("IE"); 

19 System. out.println(Cdeque) ; 

20 } 

21 } 


The output is: 
[GB, DE, FR, ES] 
deque.getFirst(): GB 
[GB, DE, FR, ES] 
deque.removeFirst(): GB 
[DE, FR, ES] 
deque.addFirst("IT"): 
[IT, DE, FR, ES] 
deque.getLast(Q): ES 
[IT, DE, FR, ES] 
deque.removeLast(): ES 
[IT, DE, FR] 
deque.addLast("IE"): 
[IT, DE, FR, IE] 

The program tests the get(), add(Q), and remove() methods from both ends of the deque. 


THE Map INTERFACE AND ITS IMPLEMENTING CLASSES 


A map (also called a dictionary or look-up table) is a collection whose elements are key-value 
pairs. A key is a unique element that acts as an identifier for its value, which is an element that 
typically holds several components of data. 
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The information shown in Table 


: Name ISO Language Area_ Population Age 
4.3 provides a good example. If 


h d dj Austria AT German 83,870 8,192,880 40.9 
ae = a nes i pe Poland PL Polish 312,685 38,536,869 37.0 
ee Eenen S om oe. Aes France FR French 547,030 60,876,136 39.1 
wou. - 
‘ P y Germany DE German 357,021 82,422,299 42.6 
value pair. The keys would be the 
Italy IT Italian 301,230 58,133,509 42.2 


strings in the ISO column. The 
value for each key would be the 
rest of the data for the country that 
is uniquely identified by that key. Table 4.3 A table of seven records 
These data could be the fields of a 
Country object, using the Country class defined in Example 4.14. 

The Map interface is defined like this: 

public interface Map<K,V> 

Its type parameters are the key type K and the value type V. The type Map<String, Country> 
could be used for the records in Table 4.3, using the String class for K and a Country class for 
V. (See page 74.) 

The Map interface is shown in Figure 4.11 on page 94. It specifies 14 methods. Note that this 
interface does not extend the Collection interface. (See Figure 4.1 on page 70.) Sets, lists, and 
queues are collections of individual elements, which are not key-value pairs. 

The two subinterfaces and five implementing classes of the Map interface are analogous to the 
corresponding subinterfaces and implementing classes of the Set interface. Like the 
AbstractSet class, the AbstractMap class implements only those methods that are either 
independent of the underlying data structure or can be implemented using the other methods. 
Like the EnumSet, HashSet, LinkedHashSet, and TreeSet classes, the EnumMap, HashMap, 
LinkedHashMap, and TreeMap implement the abstract methods. 

The EnumMap class is defined like this: 

public class EnumMap<K extends Enum<K>,V> extends AbstractMap<K, V> 
The expression K extends Enum<K> means that only enum types can be used for the key type K. 


EXAMPLE 4.14 Using the EnumMap Class 


Portugal PT Portuguese 92,391 10,605,870 38.5 
Sweden SE _ Swedish 449,964 9,016,596 40.9 


public class TestEnumMap { 


1 

2 public static void main(String[] args) { 

3 Map<EuCodes,Country> eu = new EnumMap<EuCodes,Country>(EuCodes.class) ; 

4 eu.put(EuCodes.AT, new CountryC"Austria"”, "German", 83870, 8192880, 40.9)); 
5 eu.put(EuCodes.PL, new Country("Poland", "Polish", 312685, 38536869, 37.0)); 
6 eu.put(EuCodes.FR, new Country("France", "French" , 547030, 60876136, 39.1)); 
7 System.out.printIn(eu.size()); 

8 System.out.printIn(eu.keySet(Q)); 

9 System.out.printInCeu.getCEuCodes.PL)); 

10 } 

11 } 

12 

13 enum EuCodes { AM, AT, BY, BE, BG, HR, CY, CZ, DK, EE, FI, FR, GE, 

14 DE, GR, HU, IS, IE, IT, LV, LI, LT, LU, MK, MT, MD, MC, NL, 

15 NO, PL, PT, RO, RU, SK, SI, ES, SE, CH, TR, UA, GB, VA } 

16 

17 class Country { 

18 private String name; 


19 private String language; 
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20 private int area; 

21 private int population; 
22 private double avAge; 

23 

24 Country(String name, String lang, int area, int pop, double avAge) { 
25 this.name = name; 

26 this. language = lang; 
27 this.area = area; 

28 this.population = pop; 
29 this.avAge = avAge; 

30 } 
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Method Summary 


void) clear () 
Removes all of the mappings from this map (optional operation). 
boolean|containsKey (Object key) 
Returns true if this map contains a mapping for the specified key. 


boolean! containsValue (Object value) 
Returns true if this map maps one or more keys to the specified value : 
Set<Map .Entry<K,V>>/entrySet () 
Returns a Set view of the mappings contained in this map 


boolean|equals (Object 0) 
Compares the specified object with this map for equality. 


Yl get (Object key) 
Returns the value to which the specified key is mapped, or null ifthis map contains no 
mapping for the key. 
int | hashCode () 
Returns the hash code value for this map. 
boolean! isEmpty () 
Returns true if this map contains no key-value mappings. 
Set<K> lkeySet () 
Returns a Set wiew of the keys contained in this map. 
¥)put (K key, V value) 
Associates the specified value with the specified key in this map (optional operation). 
void|putAll (Map<? extends K,? extends ¥> m) 
Copies all of the mappings from the specified map to this map (optional operation). 
¥lremove (Object key) 
Removes the mapping for a key from this map if it is present (optional operation). 
int] size () 
Returns the number of key-value mappings in this map. 
Collection<V> | values () 
Returns a Collection wew of the values contained in this map. 


Figure 4.11 The java.util .Map interface 
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32 public String toString( { 
33 return "[" + name + ": " + language + ", " + area +", " + population 
34 +", " + avAge + "J]"; 
35 } 
36 } 
The output is: 
3 
[AT, FR, PL] 


[Poland: Polish, 312685, 38536869, 37.0] 
The program uses two other classes: EuCodes at lines 13-15, and Country at lines 17-36. These are 
the type arguments for the type parameters K and V at line 3. 
Three key-value pairs are inserted at lines 3-6. The size(), keySet(), and get() methods are tested 
at lines 7-9. The get() method illustrates the look-up table nature of a map: the call 
eu.get (EuCodes. PL) 
looks up the record (the “value”’) for the key EuCodes. PL and returns it as: 
[Poland: Polish, 312685, 38536869, 37.0] 


The JCF implements four extensions of the AbstractMap class: the EnumMap class, the 
HashMap class, the LinkedHashMap class, and the TreeMap class. Their distinctions are the same 
as the distinctions among the corresponding four extensions of the AbstractSet class. The 
HashMap class allows any class to be the key type. Its LinkedHashMap extension maintains a 
doubly linked list of all its elements, allowing ordered key access according the order of inser- 
tion. The TreeMap class allows ordered key access according to either the key type’s natural 
order or a Comparator object passed to its constructor. 


THE Arrays CLASS 


The java.util.Arrays class contains static methods for processing arrays, including 
methods for converting arrays to collections and vice versa. The class is briefly outlined on 
page 29. 

Most of the methods in the Arrays class are overloaded with separate versions for the generic 
array type T[] and for each of the nine specific array types: boolean[], byte[], char[], 
double[], Float[], intL], long[], Object[], and short[]. In addition, most of these have a 
separate version for a subarray. For example, here are the 18 overloaded versions of the sort () 
method: 


void sort(boolean[] a) 

void sort(boolean[] a, int fromIndex, int toIndex) 
void sort(byte[] a) 

void sort(byte[] a, int fromIndex, int toIndex) 
void sort(c[] a) 

void sort(byte[] a, int fromIndex, int toIndex) 
void sort(double[] a) 

void sort(double[] a, int fromIndex, int toIndex) 
void sortCint[] a) 

void sortCint[] a, int fromIndex, int toIndex) 
void sort(float[] a) 

void sort(float[] a, int fromIndex, int toIndex) 
void sort(long[] a) 


void sort(long[] a, int fromIndex, int toIndex) 
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void sort(Object[] a) 

void sort(Object[] a, int fromIndex, int toIndex) 
void sort(short[] a) 

void sort(short[] a, int fromIndex, int toIndex) 
<T> void sort(T[] a, Comparator<? super T> c) 

<T> void sort(’[] a, int fromIndex, int toIndex, 


Comparator<? super T> c) 
(There is no sort() method for boolean arrays.) 
The eight different method() categories are: 
binarySearch() 
copyOFf () 
copyOfRange() 
equalsQ 
Fill1O 
hashCode() 
sortd 
toString() 
In addition, there is a single asList() method that returns an ArrayList collection containing 
the elements passed to it, like this: 
List<String> list = Arrays.asListC"CA", "US", "MX"); 


THE Collections CLASS 


The java.util.Collections class provides over 50 static utility methods that implement 
algorithms for sorting, searching, shuffling, and maintaining collections, among other tasks. 


EXAMPLE 4.15 Using Utility Methods from the Collections Class 


This program illustrates the addA11(), swap(), sort(), binarySearch(), and reverse() methods 


that are defined in the Collections, class: 
1 public class TestCollections { 


2 public static void main(String[] args) { 

3 List g8 = new ArrayListQ); 

4 Collections.addAl1(g8, "US", "DE", "JP", "FR", "GB", "RU", "CA", "IT"); 
5 System.out.printIn(g8) ; 

6 Collections.swap(g8, 2, 4); 

7 System.out.printIn(g8) ; 

8 Collections.sort(g8) ; 

9 System.out.printIn(g8) ; 

10 int k = Collections.binarySearch(g8, "CN"); 
11 System. out.printIn(k) ; 

12 if «k < 0) { 

13 g8.add(-k - 1, "CN"); 

14 3; 

15 System.out.printIn(g8) ; 

16 Collections. reverse(g8) ; 

17 System.out.printIn(g8) ; 

18 } 

19 } 


The output is: 
[US, DE, JP, FR, GB, RU, CA, IT] 
[US, DE, GB, FR, JP, RU, CA, IT] 
[CA, DE, FR, GB, IT, JP, RU, US] 
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-2 
[CA, CN, DE, FR, GB, IT, JP, RU, US] 
[US, RU, JP, IT, GB, FR, DE, CN, CA] 
At line 4, the addA11() method is used to load eight strings into the empty list g8. At line 6, the "JP" 
and "GB" elements are swapped, and then the list is sorted at line 8. 
At line 10, the binarySearch() method searches for the string "CN". The negative output signals that 
it is not there. The value of k tells where to insert it to keep the list sorted: at index -k - 1. 
Finally, at line 16, the reverse() method reverses the entire list. 


AUTOBOXING 


Java 5.0 introduced autoboxing, which means that a primitive value can be added to a collec- 
tion without explicitly “wrapping” it in an object; it will be wrapped, or “boxed” automatically. It 
will also be unwrapped when extracted. This is illustrated in Example 4.16. 


EXAMPLE 4.16 Using Autoboxing 


This program creates a list of five integers as an ArrayList<Integer> object: 


1 import java.util.*; 

2 

3 public class TestAutoboxing { 

4 public static void main(String[] args) { 

5 List<Integer> list = new ArrayList<Integer>Q ; 

6 Collections.addAl1Clist, 22, 33, 44, 55, 66); 

7 System.out.printfC"list: %s%n", list); 

8 System.out.printfC"list.sizeQ: %s%n", list.sizeQ)); 
9 System.out.printfC"list.get(2): %s%n", list.get(2)); 
10 int n = list.get(2); 
11 System.out.printf("n: %d%n", nN); 
12 list.remove(2); 
13 System.out.printfC"list: %S%n", List); 
14 System.out.printfC"list.sizeQ: %s%n", list.sizeQ); 
15 list.remove(new Integer(66)); 
16 System.out.printfC"list: %s%n", List); 
17 System.out.printfC"list.sizeQ): %s%n", list.sizeQ); 
18 } 
19 } 

The output is: 

list: [22, 33, 44, 55, 66] 


list.sizeQ: 5 
list.get(2): 44 


n: 44 

list: [22, 33, 55, 66] 
list.sizeQ: 4 

list: [22, 33, 55] 


list.sizeQ: 3 

At line 6, the addA11Q method from the Collections class inserts five Integer objects into list. 
The primitive int values, 22, 33, 44, 55, and 66 are “boxed” automatically, instantiating the five wrapper 
objects to hold them. 

At line 10, the element at index 2, which is the Integer object that holds the primitive int value 44, is 
accessed by the get() method. It is automatically “unboxed” during its assignment to the int variable n. 

The remove() method is overloaded in the JCF: One version takes an int argument that specifies the 
position of the element to be removed, and one version takes a reference to an object that equals() the 
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element to be removed. These two versions are invoked at lines 12 and 15. At line 12, the element at index 
2, which is the object that holds 44, is removed. At line 15, the element that equalsQ the object that 
holds 66 is removed. 
Note that this would not work: 
list.remove(22); // ERROR: the argument refers to index 22 
Autoboxing occurs only when the types would otherwise be mismatched. 


Autoboxing, which might better be called “autowrapping,” also works in other contexts where 
an explicit instantiation of a wrapper class would otherwise be required. For example: 
Double pi = 3.14159; // OK 
Note that automatic implicit instantiation (also called promotion) also occurs when a primitive 
expression is combined with a String object by means of the concatenation operator +. For 
example: 
System.out.printInC"circumference = " + 2*pi*r); // OK 
In this case, the value of the double expression 2*pi*r is automatically wrapped in a new 
Double object, whose toString() method is then invoked to produce the String object that is 
concatenated with the String literal "circumference = ". This suggests that the alternative, 
System.out.printfC"circumference = %f%n", 2*pi*r); // BETTER 
should be preferred. 


Review Questions 


4.1. What is the Java Collections Framework? 

4.2. What are the four main types defined in the Java Collections Framework? 

4.3. What is a queue? 

4.4 What is a deque? 

4.5 What is a legacy class? 

4.6 Which collection classes are implemented with an indexed structure (i.e., a simple array)? 
4.7 Which collection classes are implemented with a linked structure? 

4.8 Which collection classes are implemented with a linked indexed structure? 

4.9 What is an ArrayList object? 

4.10 What is aLinkedList object? 

4.11 What is the Collection interface? 

4.12 What set-theoretic methods are supported by the Java Collections Framework? 
4.13 What is an iterator? 

4.14 How are iterators similar to array indexes? 

4.15 How are iterators different from array indexes? 

4.16 Howcan the Arrays class be used to initialize a collection? 

4.17 Howcan the Collections class be used to initialize a collection? 

4.18 What is a generic class? 

4.19 What is a generic method? 
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4.20 
4.21 
4.22 
4.23 


4.1 


4.2 


4.3 


4.4 


4.5 


4.6 


4.1 


4.2 
4.3 


4.4 


4.5 


What is a generic type parameter? 
What is a constrained generic type parameter? 
What is a generic type argument? 


What is autoboxing? 


Problems 


Write and test this method: 
int chars(List<String> strings) 
// returns the total number of characters in all the strings 


Write and test this method, using an iterator: 

void printCCollection c) 

// prints all the elements of the collection c 
Note that this has the same effect as in Example 4.4 on page 76. 


Write and test this generic method: 

<E> int frequency(Collection<E> c, Ee) f{ 

// returns the number of occurrences of e inc 
a. using an iterator. 
b. using an enhanced for loop. 


Write and test this generic method: 
<E> E getLast(List<E> list) { 
// returns the last element of the list 
a. using an iterator. 
b. using an enhanced for loop. 


Write and test this generic method: 
<E> E getElementAt(List<E> list, int index) { 
// returns the list element at the specified index 
a. using an iterator. 
b. using an enhanced for loop. 


Write and test this client method: 
<E> Collection<E> toCollection(E[] a) 
// returns a generic Collection that contains the elements 
// of the specified array 


Answers to Review Questions 


The Java Collections Framework is a group of interrelated interfaces and classes that support the cre- 
ation and use of lists, sets, and iterators. 


The four main types are List, Queue, Set, and Map. 


A queue is a first-in-first-out collection that allows elements to be added at only one end (the back) 
and removed only from the other end (the front). 


A deque is a double-ended queue that allows elements to be added and removed only from its two ends 
(the front and back). 


A legacy class is a class that has been superseded by another class. 
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The ArrayList, ArrayDeque, ArrayDeque, and HashMap classes are implemented with an array. 


The LinkedList, PriorityQueue, TreeSet, and TreeMap classes are implemented with a linked 
structure (either a linked list or a tree). 


The LinkedHashSet class and the LinkedHashMap class are implemented with a linked indexed 
structure. 


An ArrayList object is an instance of the java.util .ArrayList class. As a member of the Java 
Collections Framework, it supports all the methods of the Collection interface. As an indexed struc- 
ture, it provides direct access to its elements by their index numbers. 


A LinkedList object is an instance of the java.util.LinkedList class. As a member of the 
Java Collections Framework, it supports all the methods of the Collection interface. As a linked 
structure, it allows elements to be added and removed in constant time, independent of the size of the 
list. 


The Collection interface is a Java interface that specifies the 15 methods listed in Figure 4.2 on 
page 71. 
The four set-theoretic methods specified by the Col ]lection interface are listed in Figure 4.2 on page 


71: addA11() for unions, removeA11() for complements, retainAl1() for intersections, and 
containsA11() for testing the subset relation. 


An iterator is an object that moves about on a collection, providing access to its elements. 


An iterator can be used to traverse a collection with a for loop, in the same way that an index can be 
used to traverse an array. 


An array index provides direct access (i.e., random access) into an array. An iterator provides only 
sequential access. 


The java.util .Arrays class includes an asList() method that returns a List object whose ele- 
ments are the same as those in the array passed to it. (See Example 4.15 on page 96.) 


The java.util.Collections class includes an addA11() method that adds all elements that are 
passed to it. (See Example 4.15 on page 96.) The method takes a variable number of arguments (.e., it 
is a varargs method). 


A generic class is a class that uses generic type parameters, that is, parametrized types. (See page 96.) 
A generic method is a method that uses generic type parameters, that is, parametrized types. 


A generic type parameter is a symbol used in a class or method definition that represents a type. When 
applied, an actual class or interface name must be substituted for the parameter. 


A constrained generic type parameter is a generic type parameter that is constrained by an extends 
clause. 


A generic type argument is a class or interface name that is being substituted for a generic type param- 
eter in a generic class or generic method. 


The term “autoboxing” refers to the automatic implicit instantiation of a wrapper class thereby allow- 
ing the value of a fundamental type expression to be used where an object is expected, for example, in 
the Col lections.addA11() method. (See Example 4.16 on page 97.) 


Solutions to Problems 


int chars(List<String> strings) { 
int chars = 0; 
for (String s : strings) { 
chars += s.lengthQ); 
} 


return chars; 
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4.2 

4.3 a. 
b. 

4.4 a. 
b. 

4.5 a. 
b. 
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} 


void print(Collection c) { 
for (Iterator it = c.iterator(); it.hasNext(); ) { 


System.out.printf("%s ", it.nextQ); 


} 
System.out.printInQ; 

} 

<E> int frequency(Collection<E> c, E e) { 
int f = 0; 


for (Iterator<E> it = c.iterator(); it.hasNext(); ) { 
if Cit.nextQ.equals(e)) { 
++F; 
} 
} 
return f; 
} 
<E> int frequency(Collection<E> c, E e) { 
int f = 0; 
for (E ce: c) { 
if (ce.equals(x)) { 
++F; 
} 
} 
return f; 


} 


<E> E getLast(List<E> list) { 
E last = null; 
for (Iterator<E> it = list.iterator(); it.hasNextQ); ) { 
last = it.nextQ); 
} 
return last; 
} 
<E> E getLast(List<E> list) { 
E last null; 
for CE : list) { 
last e; 


I @ il 


5 


return last; 
} 


<E> E getElementAt(List<E> list, int index) { 
E element = null; 
int i=0; 
for (Iterator<E> it = list.iterator(); it.hasNext() && i++<=index;) { 

element = it.nextQ); 

} 
return element; 

} 

<E> E getElementAt(List<E> list, int index) { 

E element = null; 


int i=0; 
for (Ee: list) { 
if Cit+ == index) { 
return e; 
} 
} 


return null; 
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4.6 <E> Collection<E> toCollection(E[] a) { 
Collection<E> c = new ArrayList<E>Q; 
for (E ae: a) { 

c.add(ae) ; 
} 
return Cc; 


} 
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Stacks 


A stack is a collection that implements the last-in-first-out (LIFO) protocol. This means that 
the only accessible object in the collection is the last one that was inserted. A stack of books is a 
good analogy: You can’t take a book from the stack without first removing the books that are 
stacked on top of it. 


STACK OPERATIONS 


The fundamental operations of a stack are: 
1. Add an element onto the top of the stack. 
2. Access the current element on the top of the stack. 
3. Remove the current element on the top of the stack. 
These three operations are usually named push, peek, and pop, respectively. 


THE JCF Stack CLASS 


As shown in Figure 4.1 on page 70, the Java Collections Framework includes a Vector class, 
which is a specialized List type. Prior to the development of the JCF, Java extended the Vector 
class to a Stack class. But that class is now considered obsolete because it isn’t consistent with 
the JCF. Instead, the Java API recommends using the ArrayDeque class for stacks, like this: 

Deque<String> stack = new ArrayDeque<String>Q() ; 
This provides all the normal functionality of a stack, while being consistent with the JCF. (See 


page 91.) 
EXAMPLE 5.1 Testing a String Stack 


public class TestStringStack { 
public static void main(String[] args) { 

Deque<String> stack = new ArrayDeque<String>() ; 
stack.push("GB"); 
stack.push("DE"); 
Stack.pushC("FR"); 
stack.pushC("ES"); 
System. out.printIn(stack) ; 
System.out.printInC"stack.peek(): 


oO ON Do F&F WHY = 


+ stack.peekQ); 
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10 System.out.printInC"stack.pop(Q): " + stack.popQ); 

1 System.out.printIn(stack) ; 

12 System.out.printInC"stack.pop(Q): " + stack.popQ); 

13 System.out.printIn(stack) ; 

14 System.out.printInC"stack.push(\"IE\"): "); 

15 stack.pushC("IE"); 

16 System.out.printIn(stack) ; 

17 } 

18 } 


The output is: 


[ES, FR, DE, GB] 
stack.peek(Q): ES 
stack.pop(): ES 


[FR, DE, 


GB] 


stack.popQ): FR 


[DE, GB] 


stack.pushC("IE"): 


[IE, DE, 


GB] 


The push, peek, and pop operations are illustrated at lines 4, 9, and 10, respectively. 


A Stack INTERFACE 


The operational requirements of a stack (peek, pop, and push) can be formalized as a Java 


interface: 


EXAMPLE 5.2 A Stack Interface 


N OO fF WY = 


} 


public interface Stack<E> { 
public boolean isEmpty(); 
public E peek(); 
public E pop(); 
public void push(E element); 
public int sizeQ; 


The symbol E is a type parameter. (See page 76.) It stands for the deferred element type for the stack. 
In addition to the three required stack operations, this interface also specifies an isEmpty() method 


and a size() method. 


The Stack interface shown in Example 5.2 is a generic type. (See page 76.) The expression 
<E> at line | means that the element type E is unspecified. 


AN INDEXED IMPLEMENTATION 


There are several ways to implement the Stack interface. The simplest is an indexed imple- 
mentation, using an ordinary array. 


EXAMPLE 5.3 An ArrayStack Implementation 


public class ArrayStack<E> implements Stack<E> { 
private E[] elements; 


private static final int INITIAL_CAPACITY = 100; 


1 
2 
3 private int size; 
4 
5 
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6 public ArrayStackQ) { 

7 elements = (E[]) new Object [INITIAL_CAPACITY] ; 
8 } 

9 

10 public ArrayStack(int capacity) { 

11 elements = (E[]) new Object[capacity]; 

12 } 

13 

14 public boolean isEmptyQ { 

15 return (size == 0); 

16 } 

17 

18 public E peekQ) { 

19 if (size == 0) { 

20 throw new java.util.EmptyStackException() ; 
21 } 

22 return elements[size-1]; // top of stack 
23 } 

24 

25 public E popd { 

26 if (size == 0) { 

27 throw new java.util.EmptyStackExceptionQ() ; 
28 } 

29 E element = elements[--size]; 

30 elements[size] = null; 

31 return element; 

32 } 

33 

34 public void push(E element) { 

35 if (size == elements.length) { 

36 resizeQ); 

37 } 

38 elements[size++] = element; 

39 } 

40 

4 public int sizeQ { 

42 return size; 

43 } 

44 

45 private void resize() { 

46 assert size == elements. length; 

47 Object[] a = new Object[2*size]; 

48 System.arraycopy(elements, 0, a, 0, size); 
49 elements = CE[])a; 

50 } 

51 } 


Since this class is implementing the generic type Stack<E>, it too is a generic type. Thus, the type 
parameter E is used wherever the element type has to be specified. For example, at line 29, the local 
variable element is declared to have type E. 

This ArrayStack implementation uses a “backing array” elements[] to store the stack’s elements. It is 
declared at line 2 to have type E[], that is, an array of elements of type E. The other class field is the 
integer size, which keeps a count of the number of elements in the stack. 

The class defines the static constant INITIAL_CAPACITY at line 4 to be the number 100. This is used 
only to specify the initial size of the backing array at line 7. The choice of 100 is arbitrary; any reasonably 
small positive integer would suffice. 
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Two constructors are defined: the default (no-argument) constructor at line 6 and the one-argument 
constructor at line 10. Both merely allocate the backing array, using either the default INITIAL_CAPACITY 
(line 7) or a user-specified capacity (line 11). 

Notice how arrays are allocated in a generic collection: 

7 elements = (E[]) new Object [INITIAL_CAPACITY] ; 
Since generic arrays are not supported in Java, this simpler approach 
elements = new E[INITIAL_CAPACITY]; // ERROR! 
will not compile. Instead, we have to allocate the backing array as an array of Object elements, and then 
cast that array with (E[]) in order to assign it to the elements field, which has type E[]. This subterfuge 
may generate a compiler warning, which can be ignored. 

The implementations of the five methods required by the Stack interface are pretty straightforward. 
The size() method at line 41 is an accessor method that simply returns the value of the size field. The 
isEmptyQ method at line 14 returns true or false according to whether or not the value of size is 0. 

The peek(Q) method at line 18 throws an exception if the stack is empty. Otherwise, it returns the top 
element on the stack: elements[size-1]. 

The pop() method at line 25 is almost the same as the peek() method. But before returning the top 
element, it must remove it by decrementing the size field at line 29 and replacing the top element with null 
at line 30. This last step is done so that no references will be left to an inaccessible object. 

The push() method at line 34 is essentially the reverse of the pop() method. At line 38, the element 
is added to the array at the next available position, at elements [size], and then the size counter is post- 
decremented. 

If the array is full, then the size counter will equal elements. length. If that condition holds when 
push() is called, the array is doubled by a call to the private resize() method at line 36. This creates 
the temporary array a[] at line 47, copies all the elements into it at line 48, and then assigns the elements 
field to it at line 49. 

Note the use of the assert statement at line 46. If the condition size == elements. 1ength does not 
hold at that point, the program will abort with an error message, just like an uncaught exception. Of 
course, that should never happen, because the resize() method is private, and the only place where it 
is called within this class is at line 36, inside the if block for that same condition. The purpose of includ- 
ing the assert statement here is merely “insurance,” to guard against possible future modifications of the 
class that might inadvertently involve the resize() method when that condition is not true. 


The ArrayStack class implemented in Example 5.3 can be tested the same way the Stack 
class is tested in Example 5.1 on page 103. The push(), peek), and pop) calls should work 
the same way. The print1nQ) calls cannot be executed for the ArrayStack class because it has 
no toString() method. But that’s proper, because a stack really should not allow access to any 
of its elements other than the one on top. 


A LINKED IMPLEMENTATION 
The main alternative to an indexed implementation is a linked implementation, using a linked 
list. (See Chapter 3.) 


EXAMPLE 5.4 A LinkedStack Implementation 


1 public class LinkedStack<E> implements Stack<E> { 
2 private Node<E> head = new Node<E>(); // dummy node 
3 private int size; 
4 
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5 public boolean isEmptyQ { 

6 return (size == 0); 

7 } 

8 

9 public E peekQ) { 

10 if (size == 0) { 

11 throw new java.util.EmptyStackException() ; 
12 } 

13 return head.prev.element; // top of stack 
14 } 

15 

16 public E popd { 

17 if (size == 0) { 

18 throw new java.util.EmptyStackException() ; 
19 } 

20 E element = head.prev.element; 

21 head.prev = head.prev.prev; 

22 head.prev.next = head; 

23 --size; 

24 return element; 

25 } 

26 

27 public void push(E element) { 

28 head.prev = head.prev.next = new Node<E>(element, head.prev, head); 
29 ++51Ze; 

30 } 

31 

32 public int sizeQ { 

33 return size; 

34 } 

35 

36 private static class Node<E> { 

37 E element; 

38 Node<E> prev; 

39 Node<E> next; 

40 

4 Node() { 

42 this.prev = this.next = this; 

43 } 

44 

45 Node(E element, Node<E> prev, Node<E> next) { 
46 this.element = element; 

47 this.prev = prev; 

48 this.next = next; 

49 } 

50 } 

51 } 


This class implements a doubly linked list, using the private inner Node class defined at lines 36—50. 
Each node contains an element field and two link fields: one pointed to the previous node in the list and 
one pointing to the next node. The constructor defined at line 41 constructs a dummy node with nul] 
element and with both links pointing to the node itself. The three-argument constructor defined at line 45 
allows all three fields to be initialized. 

The LinkedStack class defines two fields at lines 2—3: its head node link and its size counter. Note 
that this implements the empty stack as a single dummy node with its prev and next links pointing to the 
node itself. The advantage of self-pointers is that we avoid nu11 pointers, obviating special cases in the 
push() and pop() methods. 


108 STACKS [CHAP. 5 


When the stack is not empty, the top element will always be in the node referenced by the head. prev 
link. Thus, both peek() and pop() return head. prev.element. 

To remove the top element, pop() deletes the node that contains it. This requires resetting two pointers, 
which is done at lines 21-22: 

21 head.prev = head.prev.prev; 
22 head.prev.prev.next = head; 

The push() method constructs a new node containing the specified element at line 28, and then it 
resets both the head. prev.next and the head. prev links to point to it. Note that the chained assignment 
works from right to left, first assigning the new node reference to head.prev.next and then to 
head. prev. That order of operations is critical because the existing head. prev.next node is not accessi- 
ble after head. prev is changed. 


ABSTRACTING THE COMMON CODE 


The ArrayStack and LinkedStack implementations of the Stack interface are quite differ- 
ent. Nevertheless, they do have some identical code. The size() method and the isEmpty() 
method are the same. Their implementation does not depend upon whether the backing structure 
is indexed or linked. 

When parts of an implementation are independent of the underlying data structure, it is advan- 
tageous to implement those parts separately in an abstract class. 


EXAMPLE 5.5 An AbstractStack Class 


public abstract class AbstractStack<E> implements Stack<E> { 


1 

2 protected int size; 

3 

4 public boolean isEmpty() { 
5 return (size == 0); 

6 } 

7 

8 abstract public E peekQ; 
9 

10 abstract public E popQ; 

11 

12 abstract public void push(E element) ; 
13 

14 public int sizeQ { 

15 return size; 

16 } 

17 } 


The three methods that depend upon the implementation’s backing structure are declared abstract 
(lines 8, 10, and 12). This of course requires the class itself to be declared abstract. 

Notice at line 2 that the size field is declared protected instead of private so that it can be accessed 
by the extending concrete classes. 


With the AbstractStack class implemented as shown in Example 5.5, we can now simplify 
the two concrete implementations from Example 5.3 on page 104 and Example 5.4 on page 106. 
We need only add the expression extends AbstractStack<E> to each class header, and then we 
can remove their isEmpty() and size() methods and their declaration of the size field. This of 
course is the strategy used by the JCF. (See Figure 4.1 on page 70.) 
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APPLICATION: AN RPN CALCULATOR 


Although the stack data structure is one of the simplest, it is essential in certain important 
applications. Some of these are illustrated in the following examples. 

An arithmetic expression is said to be in postfix notation (also called reverse Polish notation, 
or RPN) if each operator is placed after its operands. For example, the postfix expression for 
3*(4 + 5)is3 4 5 + *. [The expression 3*(4 + 5) is called an infix expression.] Postfix 
expressions are easier to process by machine than are infix expressions. Calculators that process 
postfix expressions are called RPN calculators. 


EXAMPLE 5.6 An RPN Calculator 


This program parses postfix expressions, performing the indicated arithmetic. It uses a stack to 
accumulate the operands. 

1 public class Calculator { 

2 public static void main(String[] args) { 

3 Deque<String> stack = new ArrayDeque<String>() ; 

4 Scanner in = new Scanner (System. in); 

5 while (true) { 

6 String input = in.nextLineQ; 

7 char ch = input.charAt(O); 

8 if (ch == '+' || ch == '-' [| ch == '*' || ch == '/') ff 
9 


double y = Double.parseDouble(stack.pop()); 
10 double x = Double.parseDouble(stack.pop()); 
1 double z = 0; 
12 switch (ch) { 
13 case '+': z= xX + y;_ break; 
14 case '-': z=x - y;_ break; 
15 case '*' z= xX * y;_ break; 
16 case '/': z=x/y; 
17 } 
18 System.out.printfC"\t%.2f %c %.2Ff = %.2f%n", x, ch, y, Z); 
19 stack.push(new Double(z).toString()); 
20 } else if (ch == 'q' || ch == 'Q') { 
21 return; 
22 } else { 
23 stack. push(Cinput) ; 
24 } 
25 } 
26 } 
27 } 
Here is one run: 

3 

4 

5 

+ 

4.00 + 5.00 = 9.00 
3.00 * 9.00 = 27.00 
10 
/ 


27.00 / 10.00 = 2.70 


2.70 - 1.00 = 1.70 
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At line 3, the program instantiates a stack of strings, like the one in Example 5.1 on page 103. Then it 
goes into an infinite while loop at line 5, interactively reading a string on each iteration at line 6. 

If the user inputs anything other than +, -, *, /, q, or Q, the program assumes it is a number and pushes 
it onto the stack at line 23. If it is one of the four arithmetic operations, then the top two numbers are 
popped off the stack at lines 9-10 and the operation is applied to them. The result is printed at line 18 and 
then pushed onto the stack at line 19. 

The program ends when the user enters q or Q (for “quit’’). 


Human readers tend to prefer infix to postfix notation for arithmetic expressions. The follow- 
ing example converts a given infix expression to postfix. 


EXAMPLE 5.7 Converting Infix to Postfix 


1 public class TestScanner { 

2 public static void main(String[] args) { 

3 Deque<String> stack = new ArrayDeque<String>() ; 
4 String line = new Scanner(System.in).nextLine() ; 
5 System.out.printIn(line); 

6 Scanner scanner = new Scanner(line); 

7 while (scanner.hasNext()) { 

8 if (scanner.hasNextIintQ)) { 

9 System.out.print(scanner.nextIntQ +" "); 
10 t+ else { 

11 String str = scanner. nextQ); 

12 if C"+-*/".indexOf(str) >= 0) { 

13 stack.push(str); 

14 } else if (str.equals(")")) { 

15 System.out.print(stack.pop() +" "); 

16 } 

17 } 

18 } 

19 while (!stack.isEmpty()) { 

20 System.out.print(stack.pop() +" "); 

21 } 

22 System.out.printInQd; 

23 } 

24 } 


The output is: 
¢ 80 - 30 ) * ¢ 40 + 50 / 10 ) 
80 30 - 40 50 10 / + * 

The output shows that the program parsed the infix expression ( 80 - 30 ) * ( 40+ 50 / 10) to 
generate its postfix equivalent 80 30 - 405010/+*. 

The program uses a stack, declared at line 3, and a scanner, declared at line 6. The scanner extracts 
integer and string tokens from the input line. If the token is an integer, it is printed immediately, at line 9. 
If it is one of the four strings "+", "-","*", or "/", it is pushed onto the stack at line 13. If it is the strings 
")", then the top element of the stack is popped off and printed at line 15. After the input line has been 


completely parsed, the remaining elements are popped from the stack and printed at lines 19-21. 


Review Questions 


5.1 Why are stacks called LIFO structures? 


5.2. Would it make sense to call a stack 
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5.3 


5.4 


5.1 


5.2 


5.3 
5.4 


5.5 
5.6 


5.7 


a. aLILO structure? 
b. a FILO structure? 


What is 
a. prefix notation? 
b. infix notation? 
c. postfix notation? 


Determine whether each of the following is true about postfix expressions: 
a xytzt=xyztd 


Xytz—=xXyz—-Tt 


b. 
c xy-zt=xyzt+— 
d. xy-z-=xyz-- 


Problems 


Trace the following code, showing the contents of the stack after each invocation: 


Stack stack = new Stack(Q); 
stack.push(new Character('A') 
) 


); 
stack.push(new Character('B')); 
stack.push(new Character('C')); 
stack.popQ() ; 
stack. pop) ; 
stack.push(new Character('D')) 
stack.push(new Character('E')) 
stack.push(new Character('F')) 
stack.popQ() ; 
stack.push(new Character('G')); 
stack.pop() ; 
stack.popQ) ; 


stack.popQ) ; 


Translate each of the following prefix expressions into infix: 
a. —/+* abcde 
b. /-—ab*ct+de 
c. /at+tb*c-de 


Translate the prefix expressions in Problem 5.2 into postfix. 


Translate each of the following infix expressions into prefix: 
a. (at+b)-(c/(dt+e)) 
b. a/((b/c)*(d-e)) 
ce. (a/(b/c))*(d-e) 


Translate the infix expressions in Problem 5.4 into postfix. 


Translate each of the following postfix expressions into prefix: 
a. ab+cd-—/et 
b. abc+de-* —- 
ce. abcde//// 


Translate the postfix expressions in Problem 5.6 into infix. 
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5.9 


5.10 


5.11 


5.12 


5.13 


5.14 


5.15 


5.16 


5.17 


5.18 


5.19 


5.20 


5.21 


5.22 
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Write this client method using only the push(), peek(), pop(), and isEmpty() methods: 
public static <E> Deque<E> reversed(Deque<E> stack) 
// returns a new stack that contains the same elements as the given 
// stack, but in reversed order 


Write this client method using only the push(), peek(), pop), and isEmpty() methods: 
public static <E> Deque<E> reversed(Deque<E> stack) 
// returns a new stack that contains the same elements as the given 
// stack, but in reversed order, and leaves the given stack in its 
// original state 


Write this client method using only the push(), peek(), pop(), and isEmpty() methods: 
public static <E> void reverse(Deque<E> stack) 
// reverses the contents of the specified stack 


Write this client method using only the push(), peek(), pop), and isEmpty() methods: 
public static <E> E penultimate(Deque<E> stack) 
// returns the second from the top element of the specified stack 


Write this client method using only the push(), peek), pop(), and isEmpty() methods: 
public static <E> E popPenultimate(Deque<E> stack) 
// removes and returns the second element of the specified stack 


Write this client method using only the push(), peek(), pop), and isEmpty() methods: 
public static <E> E bottom(Deque<E> stack) 
// returns the bottom element of the specified stack 


Write this client method using only the push(), peek(), pop), and isEmpty() methods: 
public static <E> E popBottom(Deque<E> stack) 
// removes and returns the bottom element of the specified stack 


Add this member method to the ArrayStack class shown in Example 5.3 on page 104: 
public void reverse() 
// reverses the contents of this stack 


Add this member method to the LinkedStack class shown in Example 5.4 on page 106: 
public void reverse() 
// reverses the contents of this stack 


Add this member method to the ArrayStack class shown in Example 5.3 on page 104: 
public E penultimateQ 
// returns the second from the top element of this stack 


Add this member method to the LinkedStack class shown in Example 5.4 on page 106: 
public E penultimate() 
// returns the second from the top element of this stack 


Add this member method to the ArrayStack class shown in Example 5.3 on page 104: 
public E popPenultimate() 
// removes and returns the second element of this stack 


Add this member method to the LinkedStack class shown in Example 5.4 on page 106: 
public E popPenultimate() 
// removes and returns the second element of this stack 


Add this member method to the ArrayStack class shown in Example 5.3 on page 104: 
public E bottom() 
// returns the bottom element of this stack 


Add this member method to the LinkedStack class shown in Example 5.4 on page 106: 
public E bottom() 
// returns the bottom element of this stack 
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Add this member method to the ArrayStack class shown in Example 5.3 on page 104: 


public E popBottom() 
// removes and returns the bottom element of this stack 


Add this member method to the LinkedStack class shown in Example 5.4 on page 106: 


public E popBottom() 
// removes and returns the bottom element of this stack 


Answers to Review Questions 


Stacks are called LIFO structures because the last element that is inserted into a stack is always the 
first element to be removed. LIFO is an acronym for last-in-first-out. 


a. 


b. 


Boe oP 


No, because a LILO structure would mean last-in-last-out, which is just the opposite of the “last- 
in-first-out” protocol. 

Yes, because a FILO structure would mean first-in-last-out, which is the same as a last-in-first- 
out protocol. 


The prefix notation for arithmetic expressions places binary operators ahead of both of their 
operands. For example, the expression “x + 2” is written “+ x 2” in prefix notation. The standard 
functional notation used in mathematics uses prefix notation: f(x), sinx, and so on. 

The infix notation for arithmetic expressions places binary operators between their operands. 
Infix notation is the usual format for arithmetic expressions, for example, x + 2. 

The postfix notation for arithmetic expressions places binary operators after both of their oper- 
ands. For example, the expression “x + 2” is written “x 2 +” in postfix notation. The factorial 
function in mathematics uses postfix notation: 7!. 


True, because (x + y)+z=x+(y+z). 
True, because (x + y)-—z=x+(y—Z). 

False, because (x —y) +z #x-(y +z). 
False, because (x — y) -—z#x-(y—z). 


Solutions to Problems 


eR oR ose 


eae 


eae 


v 


AB — ABC —> AB > A —> AD -> ADE —> ADEF —> ADE —> ADEG —- ADEF —> ADE 
(a*b+c)/d-e) 

(a=b)i(e* (adte)) 

al(b+(c*(d-e))) 

ab*ct+d/e- 

ab-cdet*/ 

abcde—-*+/ 


(a+b)-(c/(dt+e))=-+ab/ct+de 
al((b/c)*(d-e))=/a*/bc-—de 
(a/(b/c))*(d-e)=*/a/bc-de 
(at+b)-(c/(dte))=abt+cdet/-—- 
al((b/c)*(d-e))=abc/de-*/ 
(a/(b/c))*(d-e)=abc//*de-* 
(a+ b)/(c—d)+e 

a-(b+c)*(d-e) 
al(b/(cl(d/e))) 
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5.7 a. +/+ab-—cde 
b. -a*+bhc-de 
ce /alb/cide 


5.8 public static <E> Deque<E> reversed(Deque<E> stack) { 
// returns a new stack that contains the same elements as the given 
// stack, but in reversed order 
Deque<E> stackl = new ArrayDeque<E>(); 
while(!stack.isEmpty()) { 
stackl.push(stack.pop()); 
} 
return stack1; 


i 


5.9 public static <E> Deque<E> reversed(Deque<E> stack) { 
// returns a new stack that contains the same elements as the given 
// stack, but in reversed order, and leaves the given stack in its 
// original state 
Deque<E> stackl = new ArrayDeque<E>(); 
Deque<E> stack2 = new ArrayDeque<E>(); 
while(!stack.isEmpty()) { 
stackl.push(stack.peek()); 
stack2.push(stack.pop()); 
} 
while(!stack2.isEmpty()) { 
stack.push(stack2.pop()); 
} 
return stack1; 


} 


5.10 public static <E> void reverse(Deque<E> stack) { 

// reverses the contents of the specified stack 

Deque<E> stackl = new ArrayDeque<E>(); 

Deque<E> stack2 = new ArrayDeque<E>(); 

while(!stack.isEmpty()) { 
stackl.push(stack.pop()); 

} 

while(!stackl.isEmpty()) { 
stack2.push(stack1l.pop()); 


} 
while(!stack2.isEmpty()) { 
stack.push(stack2.pop()); 
} 
} 


5.11 public static <E> E penultimate(Deque<E> stack) { 
// returns the second from the top element of the specified stack 
E xl = stack.popQ; 
E x2 = stack.peekQ); 
stack. push(x1) ; 
return x2; 


} 


5.12 public static <E> E popPenultimate(Deque<E> stack) { 
// removes and returns the second element of the specified stack 
E xl = stack.popQ; 
E x2 = stack.popQ; 
stack. push(x1) ; 
return x2; 


} 


5.13 public static <E> E bottom(Deque<E> stack) { 
// returns the bottom element of the specified stack 
Deque<E> stackl = new ArrayDeque<E>(); 
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while(!stack.isEmpty()) { 
stack1.push(stack.pop()); 

} 

E x = stackl.peekQ; 

while(!stackl.isEmpty()) { 
stack.push(stack1.pop()); 

} 

return x; 


} 


5.14 public static <E> E popBottom(Deque<E> stack) { 
// removes and returns the bottom element of the specified stack 
Deque<E> stackl = new ArrayDeque<E>(); 
while(!stack.isEmpty()) { 

stackl.push(stack.pop()); 
} 
E x = stackl.popQ; 
while(!stackl.isEmpty()) { 
stack.push(stackl.pop()); 
} 
return x; 


} 


5.15 public void reverse() { 

// reverses the contents of the this stack 

for Cint i=0; i<size/2; i++) { 
E e = elements[i]; 
elements[i] = elements[size-1-i]; 
elements[size-1-i] = e; 

} 

} 


5.16 public void reverse() { 
// reverses the contents of the this stack 
Node<E> p = head; 
for Cint i=0; i<=size; i++) { 
Node<E> q = p.next; 
p.next = p.prev; 
p = p.prev = q; 
} 
} 


5.17 public E penultimate() { 
// returns the second from the top element of this stack 
if (size < 2) { 
throw new java.util .NoSuchElementException() ; 
} 
return elements[size-2]; 


} 


5.18 public E penultimate() { 
// returns the second from the top element of this stack 
if (size < 2) { 
throw new java.util .NoSuchElementException() ; 
} 
return head.prev.prev.element; // second from top of stack 


i 


5.19 public E popPenultimate() { 
// removes and returns the second element of this stack 
if (size < 2) { 
throw new java.util .NoSuchElementException() ; 
} 


E element = elements[size-2]; 
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elements[size-2] 
elements[size-1] 
--size; 

return element; 


elements[size-1]; 
null; 


public E popPenultimateQ) { 


i 


// removes and returns the second element of this stack 
if (size < 2) { 
throw new java.util .NoSuchElementException() ; 
} 
E element = head.prev.prev.element; 
head.prev.prev = head.prev.prev.prev; 
head.prev.prev.next = head.prev; 
--size; 
return element; 


public E bottom() { 


} 


// returns the bottom element of this stack 
if (size == 0) { 

throw new java.util.EmptyStackException() ; 
} 
return elements[0]; // bottom of stack 


public E bottom() { 


} 


// returns the bottom element of this stack 
if (size == 0) { 

throw new java.util.EmptyStackException() ; 
} 


return head.next.element; // bottom of stack 


public E popBottom() { 


} 


// removes and returns the bottom element of this stack 
if (size == 0) { 
throw new java.util.EmptyStackException() ; 
} 
E element = elements[0]; 
for Cint i=0; i<size-2; i++) { 
elements[i] = elements[i+1]; 
} 
elements[size-1] = null; 
--size; 
return element; 


public E popBottom() { 


// removes and returns the bottom element of this stack 
if (size == 0) { 

throw new java.util.EmptyStackException() ; 
} 

E element = head.next.element; 

head.next = head.next.next; 

head.next.prev = head; 

--size; 

return element; 
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CHAPTER 6 


Queues 


A queue is a collection that implements the first-in-first-out (FIFO) protocol. This means that 
the only accessible object in the collection is the first one that was inserted. The most common 
example of a queue is a waiting line. 


QUEUE OPERATIONS 


The fundamental operations of a queue are: 
1. Add an element to the back of the queue. 
2. Access the current element at the front of the queue. 
3. Remove the current element at the front of the queue. 
Some authors use the terms enqueue and dequeue for the add and remove operations. 


THE JCF Queue INTERFACE 


As shown in Figure 4.1 on page 70, the Java Collections Framework includes a Queue inter- 
face, which is implemented by four classes: the LinkedList class, the AbstractQueue class, 
the PriorityQueue class, and the ArrayDeque class. For simple FIFO queues, the ArrayDeque 
class is the best choice: 

Queue<String> queue = new ArrayDeque<String>() ; 
This provides all the normal functionality of a queue. (See page 91.) 


EXAMPLE 6.1 Testing a String Queue 


1 public class TestStringStack { 

2 public static void main(String[] args) { 

3 Queue<String> queue = new ArrayDeque<String>Q(Q ; 
4 queue.add("GB"); 

5 queue.add("DE"); 

6 queue.add("FR"); 

7 queue.add("ES"); 

8 System.out.println(Cqueue) ; 

9 System.out.print]nC"queue.element(): 


+ queue.element()); 


10 System.out.printlnC"queue.remove(): " + queue.remove()); 
1 System.out.println(Cqueue) ; 

12 System.out.printIlnC"queue.remove(): " + queue.remove()); 
13 System.out.printIlnCqueue) ; 

14 System.out.printInC"queue.add(\"IE\"): "); 

15 queue.add("IE"); 
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16 System. out.println(queue) ; 

17 System.out.printInC"queue.remove(): " + queue.remove()); 
18 System.out.println(Cqueue) ; 

19 } 

20 } 


The output is: 
[GB, DE, FR, ES] 
queue.element(): GB 
queue.remove(): GB 


[DE, FR, ES] 
queue.remove(): DE 
[FR, ES] 
queue.add("IE"): 
[FR, ES, IE] 
queue.remove(): FR 
[ES, IE] 


The add, element, and remove operations are illustrated at lines 4, 9, and 10, respectively. 


By comparing the output in Example 6.1 with that of Example 5.1 on page 103, it is easy to 
see that the only operational difference between a queue and a stack is the access point. With a 
queue, it is at the front, where the “oldest” element—the one that has been there the longest—is 
found. With a stack, it is at the top, where the “youngest” element—the one that arrived most 
recently—is found. 

Notice that for the ArrayDeque class, the toString () method (invoked automatically by the 
printIn() method at line 8) displays the queue from front to back, and the stack from top to 
bottom. So in both cases, the access point is at the left end of the display. 


A SIMPLE Queue INTERFACE 


The operational requirements of a queue can be formalized by this simple Java interface: 
EXAMPLE 6.2 A Queue Interface 


1 public interface Queue<E> { 
2 public void addCE element); 
3 public E elementQ); 
4 public boolean isEmpty(); 
5 public E remove(); 
6 public int sizeQ; 
7 } 
In addition to the three required queue operations, this interface also specifies an isEmpty() method 
and a size() method. 


Compare the Queue interface shown in Example 6.2 with the JCF’s Queue interface, shown in 
Figure 4.10 on page 89. It includes offerQ), peekQ), and pol1() methods. In most situations, 
these are the same operations as the add(), element(), and remove() methods, respectively. If 
a limited-capacity queue is full, the add(Q) method throws an I1legalStateException, while 
the offer() method merely returns false. If the queue is empty, the element() the remove() 
methods throw a NoSuchElementException, while the peek() and pol1l() methods merely 
returns nul1. 
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AN INDEXED IMPLEMENTATION 


Like stacks and other linear data structures, queues can be implemented using an ordinary 


array. The ArrayQueue class shown in Example 6.3 is similar to the ArrayStack class shown in 
Example 5.3 on page 104. 


EXAMPLE 6.3 An ArrayQueue Implementation 


1 
2 
3 
4 
5 
6 
7 
8 
9 


public class ArrayQueue<E> implements Queue<E> { 


private E[] elements; 

private int front; 

private int back; 

private static final int INITIAL_CAPACITY = 4; 


public ArrayQueue() { 
elements = (E[]) new Object [INITIAL_CAPACITY] ; 


} 


public ArrayQueue(int capacity) { 
elements = (E[]) new Object[capacity]; 


} 
public void add(E element) { 
if (sizeQ == elements.length - 1) { 
resizeQ); 


elements[back] = element; 
if (back < elements. length - 1) { 
++back; 
} else { 
back = 0; //wrap 
} 
} 


public E elementQ { 
if (sizeQ == 0) f{ 
throw new java.util .NoSuchElementException() ; 
} 
return elements[front] ; 


} 


public boolean isEmptyQ { 
return (sizeQ) == 0); 


} 


public E removed) { 
if (sizeQ == 0) f{ 
throw new java.util .NoSuchElementException() ; 
} 
E element = elements[front]; 
elements[front] = null; 
++front; 
if (front == back) { // queue is empty 
front = back = 0; 


} 
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48 if (front == elements.length) { // wrap 
49 front = 0; 

50 } 

51 return element; 

52 } 

53 

54 public int sizeQ { 

55 if (front <= back) { 

56 return back - front; 

57 } else { 

58 return back - front + elements. length; 
59 } 

60 } 

61 

62 private void resize() { 

63 int size = sizeQ); 

64 int len = elements. length; 

65 assert size == len; 

66 Object[] a = new Object[2*len]; 

67 System.arraycopy(elements, front, a, 0, len - front); 
68 System.arraycopy(elements, 0, a, len - front, back); 
69 elements = CE[])a; 

70 front = 0; 

71 back = size; 

72 } 

73 } 


Instead of storing the size counter, this implementation stores front and back indexes into the array. 
The front element of the queue is always at elements[front], and the back element of the queue is 
always at elements[back-1] (except when back = 0). The front index is advanced each time an 
element is removed from the queue (at line 44), and the back index is advanced each time an element is 
added (at line 21). In both cases, when the index reaches the end of the array, it is “advanced” to 0. This 
“wraps” the queue around the end of the array, like a ring, allowing array elements to be reused. 


AN INDEXED IMPLEMENTATION 


We can use a doubly linked list to implement the Queue interface the same way we imple- 
mented the Stack interface in Example 5.4 on page 106. 


EXAMPLE 6.4 A LinkedQueue Class 


1 public class LinkedQueue<E> implements Queue<E> { 

2 private Node<E> head = new Node<E>(); // dummy node 

3 private int size; 

4 

5 public void add(E element) { 

6 head.prev = head.prev.next = new Node<E>(element, head.prev, head); 
7 ++51Ze; 

8 } 

9 

10 public E elementQ { 

11 if (size == 0) f{ 

12 throw new java.util.EmptyStackException() ; 

13 

14 return head.next.element; // front of queue // next <--> prev 


15 } 
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17 public boolean isEmptyQ { 

18 return (size == 0); 

19 } 

20 

21 public E remove() { 

22 if (size == 0) { 

23 throw new java.util.EmptyStackException() ; 
24 } 

25 E element = head.next.element; // next <--> prev 
26 head.next = head.next.next; // next <--> prev 
27 head.next.prev = head; // next <--> prev 
28 --size; 

29 return element; 

30 } 

31 

32 public int sizeQ { 

33 return size; 

34 } 

35 

36 private static class Node<E> { 

37 E element; 

38 Node<E> prev; 

39 Node<E> next; 

40 

4 Node() { 

42 this.prev = this.next = this; 

43 } 

44 

45 Node(E element, Node<E> prev, Node<E> next) { 
46 this.element = element; 

47 this.prev = prev; 

48 this.next = next; 

49 } 

50 } 

51 } 


The only changes that need to be made to the LinkedStack class (other than the method names) are at 
lines 14 and 25—27, where the next and prev fields are swapped. 


APPLICATION: A CLIENT-SERVER SYSTEM 


Queues are used to implement the FIFO protocol. That is common in client-server application. 
For example, when cars on a toll road arrive at a toll plaza, the cars are the clients, and the toll 
booths are the servers. If the rate at which the cars pass through the toll booths is slower than 
their arrival rate, then a waiting-line builds up. That is a queue. 


EXAMPLE 6.5 A Client-Server Simulation 


This simulation illustrates object-oriented programming (OOP). Java objects are instantiated to repre- 
sent all the interacting clients and servers. To that end, we first define Client and Server classes. 

This is an event-driven simulation, where clients arrive for service at random times and services have 
random durations. Each client will have an arrival time, a time when service starts, and a time when it 
ends. All time values will be integers. 
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1 public class Client { 

2 private int id; 

3 private int startTime; 

4 

5 public ClientCint id, int time) { 

6 this.id = id; 

7 System.out.printf("%s arrived at time %d.%n", this, time); 
8 } 

9 

10 public void setStartTimeCint time) { 
1 startTime = time; 

12 } 

13 

14 public String toString() { 

15 return "#" + id; 

16 } 

17 } 


To trace the simulation, we have the Client constructor print its arrival time (at line 7). 

Each server serves at most one client at a time, so the Server class has a client field that references that 
server’s client, or is nul1 when the server is idle. 

Each Server object also stores the time when it will stop serving its current client. That time is 
computed by adding its service time (a positive random integer) to the time when it begins serving that 
client. The random number generator used to generate those service times is stored as a random field in 
the Server object. A server’s actual service time varies with each client. But the server’s average service 
time is a fixed property of the server, initialized when the Server object is constructed (at line 10): 

1 public class Server { 


2 private Client client; 

3 private int id; 

4 private int stopTime = -1; 

5 private double meanServiceTime; 

6 private ExpRandom random; 

7 

8 public ServerCint id, double meanServiceTime) { 
9 this.id = id; 

10 this.meanServiceTime = meanServiceTime; 

TT this.random = new ExpRandom(meanServiceTime) ; 
12 } 

13 

14 public double getMeanServiceTime() { 

15 return meanServiceTime; 

16 } 

17 

18 public int getStopTime() { 

19 return stopTime; 

20 } 

21 

22 public boolean isIdleQ) { 

23 return client == null; 

24 } 

25 

26 public void startServing(Client client, int time) { 
27 this.client = client; 

28 this.client.setStartTime(time) ; 

29 this.stopTime = time + random.nextInt(); 

30 System.out.printf("%s started serving client %s at time %d.%n", 
31 this, client, time); 
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33 


34 public void stopServing(int time) { 

35 System.out.printf("%s stopped serving client %s at time %d.%n", 
36 this, client, time); 

37 client = null; 

38 } 

39 

40 public String toString(® { 

4 return "Server " + "ABCDEFGHIJKLMNOPQRSTUVWXYZ".charAt(Cid); 

42 } 

43 } 


The startServing() method (lines 26-32) assigns a new client to the server, stores the start time in 
the Client object, computes and stores the stop time in its own stopTime field, and prints a report of 
those actions. The stopServing() method (lines 34—38) stores the stop time in the Client object and 
prints another report. 

For a simulation to be realistic, it must use randomly generated numbers to simulate the natural uncer- 
tainty of the real word. Those random numbers should have the same distribution as the natural uncertain- 
ties that they represent. Service times and time between client arrivals both tend to be distributed 
exponentially. That means that the probability that the time ¢ is less than a number x is p = 1 — e** But 
the Math. random() method returns numbers that are uniformly distributed in the range 0 < p < 1. So to 
convert the random number p to the exponentially distributed random variable x, we solve the equation, 
obtaining x = —(1/A)In(1 — p). The constant 1/A is the mean of the distribution. Thus we code the 
nextDouble() method as shown at line 9: 


1 public class ExpRandom extends java.util.Random { 
2 private double mean; 

3 

4 public ExpRandom(double mean) { 

5 this.mean = mean; 

6 } 

7 

8 public double nextDouble() { 

9 return -mean*Math.log(1 - Math.random()); 
10 } 

11 

12 public int nextIntQ) f{ 

13 return Cint)Math.ceil(CnextDouble()); 

14 } 

15 } 


The actual simulation is performed by the main class shown below. It sets four constants for the simula- 
tion at lines 2—5: the number of servers, the number of clients arriving for service, the mean service time 
among the servers, and the mean time between arrivals for the clients. 

The queue is to hold the clients that have arrived for service and are waiting for an unoccupied server. 
The simulation instantiates two random exponentially distributed number generators (lines 7—8) and 
separate arrays for the Server and Client objects (lines 9-10): 

1 public class Simulation { 
private static final int SERVERS = 3; 
private static final int CLIENTS = 20; 
private static final double MEAN_SERVICE_TIME = 25; 
private static final double MEAN_ARRIVAL_TIME = 4; 
private static Queue<Client> queue = new ArrayDeque<Client>(); 
private static ExpRandom randomService = new ExpRandom(MEAN_SERVICE_TIME) ; 
private static ExpRandom randomArrival = new ExpRandom(MEAN_ARRIVAL_TIME) ; 
private static Server[] servers = new Server[SERVERS] ; 
private static Client[] clients = new Client[CLIENTS]; 
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12 public Simulation(d) { 
13 String fmt = "%-27s %6d%n"; 
14 System.out.printf(fmt, "Number of servers:", SERVERS); 
15 System.out.printfCfmt, "Number of clients:", CLIENTS); 
16 System.out.printf(fmt, "Mean service time:", MEAN_SERVICE_TIME) ; 
17 System.out.printf(fmt, "Mean interarrival time:", MEAN_ARRIVAL_TIME) ; 
18 for Cint i=0; 1i<SERVERS; i++) { 
19 double meanServiceTime = randomService.nextDoubleQ ; 
20 servers[i] = new Server(i, meanServiceTime) ; 
21 System.out.printf("Mean service time for %s: %4.1f%n", 
22 servers[i], servers[i].getMeanServiceTime()); 
23 } 
24 int nextArrivalTime = 0; 
25 for Cint t=0, clientId=0; clientId < CLIENTS; t++) f{ 
26 if (t == nextArrivalTime) { 
27 nextArrivalTime = t + randomArrival.nextIntQ; 
28 Client client = clients[clientId] = new Client(++clientId, t); 
29 queue.add(client) ; 
30 System.out.printIn(C"\tClient queue: " + queue); 
31 } 
32 for (Server server : servers) { 
33 if (t == server.getStopTime()) { 
34 server.stopServing(t) ; 
35 } 
36 if (server.isIdleQ) && !queue.isEmpty()) { 
37 Client client = (Client) queue. remove() ; 
38 System.out.printInC"\tClient queue: " + queue); 
39 server.startServing(client,t); 
40 } 
a } 
42 } 
43 } 
44 
45 public static void main(String[] args) { 
46 new SimulationQ; 
47 } 
48 } 
The output for one run was: 
Number of servers: 3 
Number of clients: 12 
Mean service time: 25 
Mean interarrival time: 4 


Mean service time for Server A: 17.2 
Mean service time for Server B: 51.7 
Mean service time for Server C: 24.5 
#1 arrived at time 0. 
Client queue: [#1] 
Client queue: [] 
Server A started serving client #1 at time 0. 
#2 arrived at time 2. 
Client queue: [#2] 
Client queue: [] 
Server B started serving client #2 at time 2. 
#3 arrived at time 4. 
Client queue: [#3] 
Client queue: [] 
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Server C started serving client #3 at time 4. 
#4 arrived at time 6. 
Client queue: [#4] 
#5 arrived at time 7. 
Client queue: [#4, #5] 
#6 arrived at time 11. 
Client queue: [#4, #5, #6] 
Server A stopped serving client #1 at time 11. 
Client queue: [#5, #6] 
Server A started serving client #4 at time 11. 
#7 arrived at time 12. 
Client queue: [#5, #6, #7] 
#8 arrived at time 16. 
Client queue: [#5, #6, #7, #8] 
#9 arrived at time 23. 
Client queue: [#5, #6, #7, #8, #9] 
#10 arrived at time 30. 
Client queue: [#5, #6, #7, #8, #9, #10] 
Server C stopped serving client #3 at time 30. 
Client queue: [#6, #7, #8, #9, #10] 
Server C started serving client #5 at time 30. 
Server B stopped serving client #2 at time 33. 
Client queue: [#7, #8, #9, #10] 
Server B started serving client #6 at time 33. 
#11 arrived at time 34. 
Client queue: [#7, #8, #9, #10, #11] 
#12 arrived at time 36. 
Client queue: [#7, #8, #9, #10, #11, #12] 

The simulation main loop is at lines 25—42. It iterates once for each clock tick t, and continues until all 
the clients have arrived. If it is time for a new client to arrive, then lines 27—30 execute, setting the next 
arrival time, creating a new Client object, and adding the new client to the queue. Then at lines 32-41, 
each Server object is updated. If it is time for an active server to finish serving its client, then its 
stopserving() method is invoked. If a server is idle and there are clients waiting in the queue, then the 
next client in the queue is removed from the queue and that server begins serving it. 

The output shows the progress of one run. Client #4 is the first client to have to wait in the queue, 
followed by #5 and #6. At time t = 11, Server A finishes serving Client #1 and begins serving Client #4, 
who therefore leaves the queue at that time. 

By the time the next server (Server C) becomes free, at time t = 30, four more clients have arrived and 
are waiting in the queue. Then C begins to serve #5. At time t = 33, Server B finishes with #2 and begins 
serving #6. Then the queue grows back to six clients waiting when the simulation finishes at time t = 36. 


The simulation in Example 6.5 is called a time-driven simulation because its main loop 
iterates once for each tick of the clock. In contrast, an event-driven simulation is one in which the 
main loop iterates once for each event: a job arrival, a service begin, or a service end. Event- 
driven simulations are usually simpler, but they require all servers to perform at the same rate. 


Review Questions 


6.1. Why are queues called FIFO structures? 


6.2 Would it make sense to call a queue 
a. aLILO structure? 
b. a FILO structure? 
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What are the advantages and disadvantages of the linked implementation of a queue relative 
to the contiguous implementation? 


Problems 


Trace the following code, showing the contents of the queue q after each call: 
ArrayQueue q; 
q.enqueue("A") 

.enqueue(""B") ; 

.enqueue("C") 

. dequeue () ; 

. dequeue() ; 

.enqueue(""D") 

.enqueue("E"); 

.enqueue("F") 

. dequeue () ; 

.enqueue(""G"); 

. dequeue() ; 

. dequeue() ; 

. dequeue () ; 
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Write this client method using only the methods specified in the Queue interface: 
public static <E> Queue<E> reversed(Queue<E> queue) { 
// returns a new queue that contains the same elements as the given 
// queue, but in reversed order 


Write this client method using only the methods specified in the Queue interface: 
public static <E> Queue<E> reversed(Queue<E> queue) { 
// returns a new queue that contains the same elements as the given 
// queue, but in reversed order, and leaves the given queue in its 
// original state 


Write this client method using only the methods specified in the Queue interface: 
public static <E> void reverse(Queue<E> queue) { 
// reverses the elements in the specified queue 


Write this client method using only the methods specified in the Queue interface: 
public static <E> E secondElement(Queue<E> queue) { 
// returns the second element in the specified queue, leaving the 
// queue in its original state 


Write this client method using only the methods specified in the Queue interface: 
public static <E> E lastElement(Queue<E> queue) { 
// returns the last element in the specified queue, leaving the 
// queue in its original state 


Write this client method using only the methods specified in the Queue interface: 
public static <E> void removeLastElement(Queue<E> queue) { 
// removes the last element in the specified queue 


Write this client method using only the methods specified in the Queue interface: 
public static <E> Queue<E> merge(Queue<E> ql, Queue<E> q2) { 
// returns a new queue that contains the same elements as the two 
// specified queues, alternately merged together, leaving the two 
// specified queues in their original state 
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Add this member method to the ArrayQueue class shown in Example 6.3 on page 119: 
public void reverse() 
// reverses the contents of this queue 


Add this member method to the LinkedQueue class shown in Example 6.4 on page 120: 
public void reverse() 
// reverses the contents of this queue 


Add this member method to the ArrayQueue class shown in Example 6.3 on page 119: 
public E second() { 
// returns the second element of this queue 


Add this member method to the LinkedQueue class shown in Example 6.4 on page 120: 
public E second() { 
// returns the second element of this queue 


Add this member method to the ArrayQueue class shown in Example 6.3 on page 119: 
public E removeSecond() { 
// removes and returns the second element of this queue 


Add this member method to the LinkedQueue class shown in Example 6.4 on page 120: 
public E removeSecond() { 
// removes and returns the second element of this queue 


Answers to Review Questions 


Queues are called FIFO structures because the first element that is inserted into a queue is always the 
first element to be removed. FIFO is an acronym for first-in-first-out. 


a. Yes, because a LILO structure would mean last-in-last-out which is just the same as a first-in- 
first-out protocol. 

b. No, because a FILO structure would mean first-in-last-out which is the opposite of the first-in- 
first-out protocol. 


The advantage of the linked implementation is that it essentially eliminated the possibility of queue 
overflow, that is, the number of calls to the addQ) method is limited only by the amount of computer 
memory available to the new operator. The only real disadvantage is that the linked implementation 
uses pointers, so it is more complicated than the contiguous implementation. 


Solutions to Problems 


A — AB > ABC > BC > C > CD > CDE > CDEF —> DEF — DEFG > EFG > FG 4 G 
public static <E> Queue<E> reversed(Queue<E> queue) { 
// returns a new queue that contains the same elements as the given 
// queue, but in reversed order 
Queue<E> queuel = new ArrayDeque<E>(); 
Deque<E> stack = new ArrayDeque<E>Q() ; 
while(!queue.isEmpty()) { 
stack. push(queue. remove()) ; 


} 

while(!stack.isEmpty()) { 
queuel.add(stack.pop()); 

} 


return queuel; 
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6.3 public static <E> Queue<E> reversed(Queue<E> queue) { 
// returns a new queue that contains the same elements as the given 
// queue, but in reversed order, and leaves the given queue in its 
// original state 
Queue<E> queuel = new ArrayDeque<E>(); 
Deque<E> stack = new ArrayDeque<E>() ; 
for Cint i=0; i<queue.size(); i++) { 
stack.push(queue.element()) ; 
queue.add(queue. remove()); 
} 
while(!stack.isEmpty()) { 
queuel.add(stack.pop()); 
} 
return queuel; 


} 


6.4 public static <E> void reverse(Queue<E> queue) { 
// returns a new queue that contains the same elements as the given 
// queue, but in reversed order, and leaves the given queue in its 
// original state 
Deque<E> stack = new ArrayDeque<E>() ; 
while(!queue.isEmpty()) { 
stack. push(queue. remove()) ; 
} 
while(!stack.isEmpty()) { 
queue.add(stack.pop()); 
} 
} 


6.5 public static <E> E secondElement(Queue<E> queue) { 
// returns the second element in the specified queue, leaving the 
// queue in its original state 
queue.add(queue. remove()); 
E element = queue.element(); 
for Cint i=1; i<queue.size(); i++) { 
queue.add(queue. remove()) ; 
} 
return element; 


} 


6.6 public static <E> E lastElement(Queue<E> queue) { 
// returns the last element in the specified queue, leaving the 
// queue in its original state 
for Cint i=1; i<queue.size(); i++) { 
queue.add(queue. remove()); 
} 
E element = queue.element(); 
queue. add(queue.remove()); 
return element; 


} 
6.7 public static <E> void removeLastElement (Queue<E> queue) { 
// removes the last element in the specified queue 
for Cint i=1; i<queue.size(); i++) { 
queue.add(queue. remove()); 
} 
queue. remove() ; 
} 
6.8 public static <E> Queue<E> merge(Queue<E> queuel, Queue<E> queue2) { 


// returns a new queue that contains the same elements as the two 
// specified queues, alternately merged together, leaving the two 
// specified queues in their original state 
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Queue<E> queue3 = new ArrayDeque<E>(); 

int nl = queuel.size(); 

int n2 = queue2.size(); 

int n = Math.min(nl1, n2); 

for Cint i = 0; i < n; i++) f{ 
queue3.add(queuel.element()); 
queuel.add(queuel. remove()); 
queue3.add(queue2.element()); 
queue2.add(queue2.remove()) ; 

} 

for Cint i = 0; i < nl - n; i++) { 
queue3.add(queuel.element()); 
queuel.add(queuel. remove()); 

} 

for Cint i = 0; i < n2 - n; i++) { 
queue3.add(queue2.element()); 
queue2.add(queue2.remove()); 

} 

return queue3; 


i 


public void reverse() { 
// reverses the contents of this queue 
resizeQ); 
int n = sizeQ); 
for Cint i=0; i<n/2; i++) { 
E e = elements[i]; 
elements[i] = elements[n-1-i]; 
elements[n-1-i1] = e; 
} 
} 


public void reverse() { 
// reverses the contents of this queue 
Node<E> p = head; 
for Cint i=0; i<=size; i++) f{ 
Node<E> q = p.next; 
p.next = p.prev; 
p = p.prev = q; 
} 
} 


public E second() { 


// returns the second element of this queue 


if (sizeQ < 2) { 


throw new java.util .NoSuchElementException() ; 


} 
int len = elements. length; 
if (front + 1 == len) { 
return elements[0]; 
} else { 
return elements[front+1]; 
} 
} 


public E second() { 


// returns the second element of this queue 


if (size < 2) { 


throw new java.util .NoSuchElementException() ; 


} 


return head.next.next.element; 
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6.13 public E removeSecond() { 
// removes and returns the second element of this queue 
if (sizeQ < 2) { 
throw new java.util .NoSuchElementException() ; 
} 
int len = elements. length; 
Ee; 
if (front + 1 == len) { 
e = elements[0]; 
elements[0] = elements[len-1]; 
elements[len-1] = null; 
front = 0; 
} else { 
e = elements[front+1]; 
elements[front+1] = elements[front]; 
elements[front] = null; 
++front; 
} 
return e; 


} 


6.14 public E removeSecond() { 
// removes and returns the second element of this queue 
if (sizeQ < 2) { 
throw new java.util .NoSuchElementException() ; 
} 
E element = head.next.next.element; 
head. next.next = head.next.next.next; 
head.next.next.prev = head.next; 
--size; 
return e; 


Lists 


A list is a collection of elements that are accessible sequentially: the first element, followed by 
the second element, followed by the third element, and so on. This is called sequential access or 
linked access (as opposed to direct or indexed access). A good example is a line of train cars on a 
track: To get to the fourth car from the first car, you have to go through the second and third cars 
in that order. This example also illustrates how insertions and deletions are managed in a list. The 
only changes needed are to the two cars that are adjacent to location where the insertion or 
deletion is made. None of the other cars is affected. 


THE JCF List INTERFACE 


The List interface specified by the Java Collections Framework is outlined on page 85. It 
adds 10 methods to the 15 methods specified by the Collection interface that it extends. 

From the JCF inheritance hierarchy shown in Figure 4.1 on page 70, you can see that the 
Queue, Deque, and Set interfaces all extend the List interface. Consequently, all of the List, 
Queue, Deque, and Set classes implement the List interface. This includes the concrete classes 
outlined in Chapter 4: ArrayList, Vector, LinkedList, PriorityQueue, ArrayDeque, 
EnumSet, HashSet, LinkedHashSet, and TreeSet. 

As Table 4.1 on page 70 shows, the JCF provides both linked and indexed implementations of 
the List interface: the LinkedList class uses sequential access, while the ArrayList class 
provides direct access. 


EXAMPLE 7.1 Testing a List Class 


This program illustrates some of the methods specified by the List interface: 


1 public class TestStringList { 

2 public static void main(String[] args) { 

3 List<String> list = new ArrayList<String>Q ; 

4 Collections.addAl1(list, "GB", "DE", "FR", "ES"); 

5 System.out.printIn(list); 

6 list.add(3, "DE"); 

7 System.out.printIn(list); 

8 System.out.printinC"list.get(3): " + list.get(3)); 

9 System.out.printInC"list.indexOf(\"DE\"): " + list.indexOfC"DE")); 
10 System.out.printInC"list.indexOF(\"IE\"): " + list.indexOfC"IE")); 
11 System.out.printInC"list.subList(1, 5): " + list.subList(1, 5)); 
12 list.removeC("DE"); 

13 System.out.printIn(list); 

14 } 

15 } 
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The output is: 
[GB, DE, FR, ES] 
[GB, DE, FR, DE, ES] 
list.get(3): DE 
list.indexOf("DE"): 1 
list.indexOfC"IE"): -1 
list.subList(1, 5): [DE, FR, DE, ES] 
[GB, FR, DE, ES] 

The list object is created at line 3 as an ArrayList of String objects. It uses the static addA11Q 
method of the Collections class at line 4 to load four strings into it. The two-argument add() method is 
tested at line 6, inserting the string "DE" as element number 2. Note that, unlike sets, lists allow duplicate 
elements. 

At line 8, the getQ method is used to obtain element number 3. Lines 9-10 illustrate the indexOfQ 
method, returning the index number of the specified element. Note that it returns the index of the first 
occurrence of the element; DE occurs at both index | and index 3. 

The call list.subList(1, 5) at line 11 returns the sublist of elements indexed from | up to (but not 
including) 5. As long as the two indexes are in range, the size of the sublist will always be their difference; 
in this case, 5 — 1 = 4 elements. 

At line 12, the call list. remove("DE") removes the first occurrence of DE. 


THE RANGE-VIEW OPERATION sublistQ 


The sublist() method illustrated in Example 7.1 is far more powerful than what that 
example suggests. It provides a “view” into its list, against which other List methods may be 
applied. When used in the chained-invocation mode, like this 

list.sublist(1, 5).get(2) 
it allows the attached method to apply to the list itself while restricting its context to the sublist. 
Thus, that call, for example, would return element number 2 of the sublist, which is actually 
element number 3 of the list itself. 


EXAMPLE 7.2 Using the sublist() Method as a Range-View Operation 


1 public class TestSubList { 

2 public static void main(String[] args) { 

3 List<String> list = new ArrayList<String>Q; 

4 Collections.addAl1Clist, "A","B","C","D","E","F","G","H", "I", "I"); 
5 System.out.printIn(list); 

6 System.out.printInC"list.subList(3,8): " + list.subList(3,8)); 

7 System.out.printInC"list.subList(3,8).get(2): " 

8 + list.subList(3,8).get(2)); 

9 System.out.printInC"list.subList(3,8).set(2,\"B\"):");3 

10 list.subList(3,8).set(2, "B"); 

11 System.out.printIn(list); 

12 System.out.printInC"list.indexOf(\"B\"): " + list.indexOf("B")); 
13 System.out.printInC"list.subList(3,8).indexOfF(\"B\"): " 

14 + list.subList(3,8).indexOf("B")); 

15 System.out.printIn(list); 

16 System.out.printInC"Collections. reverse(list.subList(3,8)):"); 
17 Collections.reverse(list.subList(3,8)); 

18 System.out.printIn(list); 

19 System.out.printInC"Collections. rotate(list.subList(3,8), 2):"); 
20 Collections.rotate(list.subList(3,8), 2); 


21 System.out.printIn(list); 
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22 System.out.printInC"Collections. fillClist.subList(3,8), \"X\"):"); 
23 Collections. fillClist.subList(3,8), "X"); 
24 System.out.printIn(list); 
25 list.subList(3,8).clearQ; 
26 System.out.printIn(list); 
27 } 
28 } 
The output is: 


[A, B, C, D, E, F, G, H, I, J] 
list.subList(3,8): [D, E, F, G, H] 
list.subList(3,8).get(2): F 
list.subList(3,8).set(2,"B"): 
[A, B, C, D, E, B, G, H, I, J] 
list.indexOf("B"): 1 
list.subList(3,8).indexOf("B"): 2 
[A, B, C, D, E, B, G, H, I, J] 
Collections.reverse(list.subList(3,8)): 
[A, B, C, H, G, B, E, D, I, J] 
Collections.rotate(list.subList(3,8), 2): 
[A, B, C, E, D, H, G, B, I, J] 
Collections. fillClist.subList(3,8), "X"): 
[A, B, C, X, X, X, X, X, I, J] 
[A, B, C, I, J] 
The call list. subList(3, 8) at line 6 returns the sublist [D, E, F, G, H]. 
The call list. subList(3, 8) .get(2) at line 8 returns F, which is element number 2 in that sublist. 
The call list.subList(3,8).set(2,"B") at line 10 replaces that element with B in the list. 
The call list. indexOf("B") at line 12 returns 1 because the first B in the list is element number 1. 
The call list.subList(3,8).indexOf("B") at line 14 returns 2 because the first B in that sublist is 
element number 2 of the sublist. 
The call Collections. reverse(list.subList(3,8)) at line 17 reverses the five-element sublist 
[D,E,B,G,H] within the original list, changing [A,B,C,D,E,B,G,H,1I,J] to [A,B,C,H,G,B,E,D,1,J]. 
The call Collections. rotate(list.subList(3,8),2) at line 20 rotates the five-element sublist 
[H,G,B,E,D] to [E,D,H,G,B], changing the whole list to [A,B,C,E,D,H,G,B,1,J]. 
The call Collections. fi11Clist.subList(3,8),"X") at line 23 replaces the five-element sublist 
[E,D,H,G,B] to [X,X,X,X,X], changing the whole list to [A,B,C,X,X,X,X,X,1,J]. 
The call list.subList(3,8).clear() at line 25 deletes the five-element sublist [X,X,X,X,X] from 
the list, changing it to [A,B,C,1I,J]. 


LIST ITERATORS 


Collection iterators are outlined on page 77. The JCF defines the ListIterator interface as 
an extension of the Iterator interface. It specifies an additional six methods that reflect the 
bidirectional nature of a list iterator. All nine methods are shown in Figure 7.1 on page 135. 

The standard way to obtain a list iterator on a list is to invoke its listIterator() method, just 
as invoking its iterator() method returns an ordinary (unidirectional) iterator. 


EXAMPLE 7.3 Using the sublist() Method as a Range-View Operation 


public class TestSubList { 
public static void main(String[] args) { 
List<String> list = new ArrayList<String>Q; 
Collections.addAl1Clist, "A","B","C","D","E","F","G","H"); 
System.out.printIn(list); 


a fF wowN = 


134 LISTS [CHAP. 7 


6 ListIterator<String> it = list.listIteratorQ; 

7 System.out.printInC"it.nextIndexQ): " + it.nextIndex()); 
8 System.out.printInC"it.nextQ: " + it.nextQ); 

9 System.out.printInC"it.previousIndex(Q): " + it.previousIndexQ); 
10 System.out.printInC"it.nextIndexQ): " + it.nextIndex()); 
1 System.out.printInC"it.nextQ: " + it.nextQ); 

12 System.out.printInC"it.nextQ: " + it.nextQ); 

13 System.out.printInC"it.previous(): " + it.previousQ); 
14 System.out.printInC"it.previousIndex(Q): " + it.previousIndexQ); 
15 System.out.printInC"it.nextIndexQ: " + it.nextIndex()); 
16 System.out.printInC"it.previous(): " + it.previousQ); 
17 System.out.printInC"it.nextQ: " + it.nextQ); 

18 System.out.printInC"it.nextQ: " + it.nextQ); 

19 System.out.printInC"it.nextQ: " + it.nextQ); 

20 System.out.printInC"it.addQ(\"X\"):")3 

21 it.add("X"); 

22 System.out.printIn(list); 

23 System.out.printInC"it.nextQ: " + it.nextQ); 

24 System.out.printInC"it.setQ"Y\"):")3 

25 it.setC"Y"); 

26 System.out.printIn(list); 

27 System.out.printInC"it.nextQ: " + it.nextQ); 

28 System.out.printInC"it.remove():"); 

29 it.remove(); 

30 System.out.printInC"it.nextQ: " + it.nextQ); 

31 System.out.printIn(list); 

32 } 

33 } 


The output is: 

[A, B, C, D, E, F, G, H] 
it.nextIndexQ): 0 
it.nextQ: A 
it.previousIndex(): 0 
it.nextIndexQ): 1 
it.nextQ: B 

it.nextQ: C 
it.previous(): C 
it.previousIndex(Q): 1 
it.nextIndexQ): 2 
it.previous(): B 
it.nextQ: B 

it.nextQ: C 

it.nextQ: D 
it.add("X"): 

[A, B, C, D, X, E, F, G, H] 
it.nextQ: E 
it.setc"Y"): 

[A, B, C, D, X, Y, F, G, H] 
it.nextQ: F 
it.remove(): 

it.nextQ: G 

[A, B, C, D, X, Y, G, H] 

The output shows the effects of the nine ListIterator methods. At lines 7 and 10, the nextIndex() 
method returns the index number of the iterator’s current element: first 0, and then 1. Similarly, the 
previousIndex() method returns the index number of the iterator’s previous element. The next() and 
previous() methods move the iterator up and down the list. At line 21, the addQ) method inserts a new 
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Method Summary 


void add(E e) 
Tnserts the specified element into the list (optional operation). 


boolean |nhasNext () 
Returns true if this list iterator has more elements when traversing the list in the forward direction. 


boolean |hasPrevious () 
Returns true if this list tterator has more elements when traversing the list in the reverse direction. 


4 Returns the next element in the list. 


int | next Index () 
Returns the index of the element that would be returned by a subsequent call to next. 


previous (} 


Returns the previous element in the lst. 


int | previousIndex () 
Returns the index of the element that would be returned by a subsequent call to previous. 


void! remove () 
Removes from the list the last element that was returned by next or previous (optional operation). 


void|/set(E e) 
ee Replaces the last element returned by next or previous with the specified element (optional operation). 


Figure 7.1 Methods specified by the java.util.ListIterator interface 


element X immediately after that last element referenced by the next() method, which was D. At line 25, 
the set() method changes the last element referenced by the next () method, from E to Y. At line 29, the 
remove() method deletes the last element referenced by the next() method, which was F. 


Like a finger tracing through some text, an iterator is an object bound to a collection that 
moves independently of other iterators on the same collection. 


EXAMPLE 7.4 Using Several ListIterator Objects Iterating on the Same List Object 


This program illustrates some of the methods that are specific to the ArrayList class: 
public class TestingSeveralIterators { 
public static void main(String[] args) { 

List<String> list = new ArrayList<String>Q; 
Collections.addAl1Clist, "A", "B", "C", "D"); 
System.out.printIn(list); 
ListIterator<String> itl = list.listIteratorQ; 
System.out.printiInC"itl.nextQ: " + itl.nextQ); 
System.out.printinC"itl.nextQ: " + itl.nextQ); 
System.out.printInC"itl.nextQ: " + itl.nextQ); 

10 System.out.printInC"itl.add(\"X\"):")3 

1 itl.add("X"); 

12 System.out.printIn(list); 
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13 ListIterator<String> it2 = list.listIteratorQ; 
14 System.out.printInC"it2.nextQ: " + it2.nextQ); 
15 System.out.printInC"it2.nextQ: " + it2.nextQ); 
16 System.out.printInC"it2.setQ\"Y\"):")3 
17 it2.setC"Y"); 
18 System.out.printIn(list); 
19 ListIterator<String> it3 = list.listIteratorQ; 
20 System.out.printInC"it3.nextQ: " + it3.nextQ); 
21 System.out.printInC"it3.nextQ: " + it3.nextQ); 
22 System.out.printInC"it3.nextQ: " + it3.nextQ); 
23 System.out.printInC"it3.nextQ: " + it3.nextQ); 
24 System.out.printInC"itl.previousQ): " + itl.previous()); 
25 System.out.printInC"itl.previousQ(): " + itl.previous()); 
26 System.out.printInC"itl.previousQ(): " + itl.previous()); 
27 } 
28 } 
The output is: 
[A, B, C, D] 
itl.nextQ: A 
itl.nextQ): B 
itl.nextQ: C 
itl.add("X"): 


[A, B, C, X, D] 
it2.nextQ): A 
jt2.nextQ: 
jt2.setC"Y"): 
[A, Y, C, X, D] 
it3.nextQ: A 
it3.nextQ: Y 
it3.nextQ: C 
it3.nextQ: X 
itl.previous(): X 
itl.previous(): C 
itl.previousQ): Y 
The first iterator it1 advances past the first three elements and then inserts X between C and D at line 
11. The second iterator it2 advances past the first two elements and then changes B and Y at line 17. The 
third iterator it3 advances past the first four elements, including the changed element Y and the inserted 
element X. Finally, the first iterator it1 backs up over the previous three elements, including the inserted 
element X and the changed element Y. 


B 


OTHER LIST TYPES 
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The JCF defines a concrete LinkedList class. (See page 86.) But that may not be exactly 
what you need for certain applications. In those cases, the AbstractList class can be extended 
to obtain a custom-made list class that is consistent with the JCF. 


EXAMPLE 7.5 A Ring Class 


This defines a list class that uses a circular, singly linked list. It is similar to the LinkedList class, 
except that the next() method is able to wrap around from the end of the list to the beginning, thus 
forming a circle or “ring.” 


1 public class Ring<E> extends AbstractList<E> implements List<E> { 


2 private Node<E> end; 
3 private int size; 
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public boolean add(E element) { 
if (size == 0) { 
end = new Node<E>(element, null); 
end.next = end; 


} else { 
end = end.next = new Node<E>(element, end.next); 
} 
++51Ze; 
return true; 
} 
public E getCint index) { 
if Cindex < 0 || index >= size) f{ 
throw new IndexOutOfBoundsException() ; 
} 


Node<E> p = end.next; 

for Cint i=0; i<index; i++) { 
p = p.next; 

} 

return p.element; 


} 


public Iterator<E> iterator() { 
return new RingIterator(); 


} 


public String toString( { 
Node<E> p = end.next; 
StringBuilder buf = new StringBuilder("[" + p.element) ; 
while (p != end) { 
p = p.next; 
buf.append(", 
} 
buf.append("]"); 
return buf.toString( ; 


} 


+ p.element) ; 


public int sizeQ { 
return size; 


} 


private class RingIterator implements Iterator<E> { 
private Node<E> last; 
private Node<E> preLast = end; 


public boolean hasNext() { 
return size > 0; 


} 


public E nextQ) { 
if Clast == null) f{ 
last = preLast.next; 
} else { 
preLast = last; 
last = last.next; 


z 
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61 return last.element; 

62 } 

63 

64 public void remove() { 

65 if Clast == null) f{ 

66 throw new I1legalStateException() ; 
67 } 

68 if (size == 1) { 

69 end = preLast = null; 

70 t+ else { 

71 preLast.next = last.next; 
72 } 

73 if Clast == end) { 

74 end = preLast; 

75 } 

76 last = null; 

77 --size; 

78 } 

79 } 

80 

81 private static class Node<E> { 
82 E element; 

83 Node<E> next; 

84 

85 Node(E element, Node<E> next) { 
86 this.element = element; 

87 this.next = next; 

88 } 

89 } 

90 } 


The class defines nine members: the two fields: end and size; the five methods: add(Q), getQ, 
jteratorQ, sizeQ), and toStringQ); and the two classes: RingIterator and Node. The 
RingIterator class extends the Iterator class to support the iterator() method. Instances of the 
Node class are used to hold the list’s data and to maintain their links. 

The AbstractList class, which this Ring class extends, requires only its get() and size() methods 
to be overridden. For our specific purposes, we also override the add(), iterator(), and toString() 
methods. We plan to use this class to solve the Josephus problem (in Example 7.6), and for that we'll need 
to add elements to the ring, iterate around the ring, and print the ring. 

The end field references the node that references the “beginning” node in the list, that is, it points to the 
beginning of the list. Since we are using only one link per node (a “singly linked list’), we have to have 
direct access to the predecessor of any node that might have to be deleted. The end reference is null when 
the list is empty. 

The size field keeps count of how many elements are in the list. Initially 0, it is incremented in the 
add() method (line 12), decremented in the iterator’s remove() method (line 77), and returned in the 
size() method (line 43). 

The add() method invokes the Node class’s constructor (lines 7 and 10) to create a new node for the 
specified element. If the list is empty, it sets its end field to a single node that points to itself. Otherwise, 
the new node is inserted immediately after the end node, and then the new reference is set to that new 
node. This way, new elements are always added to the “end” of the ring. (See Figure 7.2.) 

The get() method is required by the AbstractList class. It uses a reference pointer p at line 20 to 
count the specified number of nodes to return access to the element with the specified index number. 

The iterator() method uses the RingIterator constructor at line 28 to return an iterator on the 
ring. The RingIterator class has two fields, both Node references. The last field points to the last node 
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Figure 7.2 The end reference in the Ring class 


accessed by the iterator’s next() method. It will be nu11 initially and immediately after each call to the 
iterator’s remove() method. The preLast field points to the node that points to the last node. It is used 
by the remove() method. 

The iterator’s hasNext() method returns true at line 51, unless the list is empty. In a circular list, every 
element has a next element. 

The next() method serves two purposes. If called immediately after a call to remove(), then it resets 
the last field (line 56), which the remove() method leaves nu11. Otherwise, it simply advances the 
preLast and last pointers (lines 58-59). 

The purpose of the remove() method at line 64 is to delete the last element accessed by the next() 
method. Normally, it does that simply by resetting one link—the preceding node’s next field (line 71). 
But it also has several special cases to handle. If its invocation does not immediately follow a call to 
next(), then it’s in an illegal state, and thus throws an I1legalStateException at line 66. If the list has 
only one element, then removing it should leave the list in its original empty state by nullifying its 
preLast field (line 69). In that case, the Ring class’s end field is also nullified. If the element being 
deleted is the one referenced by the Ring class’s end field, then that field is reset to the element’s prede- 
cessor at line 74. Finally, last is nullified at line 76 to mark the fact that the next() method was not the 
last one called in the iterator. 
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Figure 7.3 The last and preLast references in the RingIterator class 


Figure 7.3 illustrates how the last and preLast pointers work in an iterator on a Ring list. This shows 
the state of the list immediately after a call to next has returned the element C. An immediate call to 
remove() would delete the C node by resetting the B node’s next reference to point to the D node, as 
shown in Figure 7.4. 
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Figure 7.4 After a call to it. remove() 


Note the efficiency of this operation: To delete C, only one link has to be reset, and one link is nullified. 
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APPLICATION: THE JOSEPHUS PROBLEM 


This problem is based upon a report by the historian Joseph ben Matthias (Josephus) on the 
outcome of a suicide pact that he had made between himself and 40 soldiers as they were 
besieged by superior Roman forces in 67 A.D. Josephus proposed that each man slay his neigh- 
bor. This scheme necessarily leaves one to kill himself. Josephus cleverly contrived to be that 
one, thus surviving to tell the tale. 

The solution to the problem is generated by the Josephus program in Example 7.6. It uses the 
Ring class from Example 7.5 on page 136. 


EXAMPLE 7.6 The Josephus Problem 


1 public class Josephus { 

2 public static final int SOLDIERS = 8; 

3 public static final String ALPHA = "ABCDEFGHIJKLMNOPQRSTUVWXYZ" ; 
4 

5 public static void main(String[] args) { 

6 Ring<String> ring = new Ring<String>() ; 

7 for Cint i=0; i<SOLDIERS; i++) { 

8 ring.add(ALPHA.substring(i, i+1)); 

9 } 

10 System.out.printInCring) ; 

TT Iterator<String> it = ring.iteratorQ; 

12 String killer = it.nextQ; 

13 while Cring.size() > 1) { 

14 String victim = it.nextQ; 

15 System.out.printIn(Ckiller + " killed " + victim); 
16 it.remove(); 

17 killer = it.nextQ; 

18 } 

19 System.out.printInC"The lone survivor is " + it.nextQ)); 
20 } 

21 } 


Here is the output from a run for 11 soldiers: 
[A, B, C, D, E, F, G, H, I, J, K] 
A killed 
killed 
killed 
killed 
killed 
killed 
killed 
killed 
killed 
killed 

The lone survivor is G 

This output shows the solution, which is illustrated 
in Figure 7.5. Figure 7.5 The solution to the Josephus problem 
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The Ring list is instantiated at line 6 and loaded at 
lines 7-9. The iterator it is obtained from the iterator() method at line 11. After advancing past A at 
line 12, it advances past B at line 14, removes B at line 16, and then advances past C at line 17. The while 
loop continues until only one soldier remains. Each iteration advances it past two elements, naming them 
killer and victim, and removes the victim node. 
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APPLICATION: A Polynomial CLASS 


A polynomial is a mathematical function of the form: 


ml y an t...ta, jx ta, 


P(X) = agx” + a,x 
The greatest exponent, n, is called the degree of the polynomial. For example, p(x) = Ix4 -2isa 
polynomial of degree 4. The simplest polynomials are constant polynomials such as p(x) = 6 
(degree 0) and linear polynomials such as p(x) = 9x + 6 (degree 1). The unique zero polynomial 
P(x) = 0 is defined to have degree —1. In this section we present a Polynomial class whose 
instances represent mathematical polynomials and which supports the usual algebraic operations 
on polynomials. 

A polynomial can be regarded as a sum of distinct terms. A term is a mathematical function of 
the form ¢(x) = cx, where c is any real number and ¢ is any nonnegative integer. The number c is 
called the coefficient, and the number e is called the exponent. 

To define a class whose objects represent polynomials, we use a linked list of Term objects. 
For example, the polynomial p(x) = 3x7 — 2x + 5 could be represented as a list of three elements, 
where the first element represents the term 3x2, the second element represents the term — 2x, and 
the third element represents the (constant) term 5. 


EXAMPLE 7.7 A Polynomial Class 


public class Polynomial { 


1 

2 private List<Term> list = new LinkedList<Term>Q(); 
3 public static final Polynomial ZERO = new Polynomial(); 
4 

5 private Polynomial() { // default constructor 
6 } 

7 

8 public Polynomial(double coef, int exp) { 

9 if Ccoef != 0.0) f{ 

10 list.add(new Term(coef, exp)); 

11 

12 } 

13 

14 public Polynomial(Polynomial p) { // copy constructor 
15 for (Term term : p.list) { 

16 this. list.add(new Term(term)); 

17 } 

18 } 

19 

20 public Polynomial(double... a) f{ 

21 for Cint i=0; i<a.length; i++) { 

22 if cali] != 0.0) f{ 

23 list.add(new Term(a[i], 1)); 

24 } 

25 } 

26 } 

27 

28 public int degree() { 

29 if Clist.isEmptyQ) { 

30 return -1; 

31 } else { 

32 return list.get(list.sizeQ)-1).exp; 

33 } 


34 } 
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public boolean isZero() { 


} 


return list.isEmpty(); 


public Polynomial plus(Polynomial p) { 


} 


if (this.isZeroQ) { 

return new Polynomial (p); 
} 
if (p.isZeroQ) { 

return new Polynomial (this); 
} 
Polynomial q = new Polynomial(); 
ListIterator<Term> it = list. listIterator(; 
ListIterator<Term> itp = p.list.listIteratorQ; 
while (it.hasNextQ && itp.hasNextQ)) { 

Term term = it.nextQ; 

Term pTerm = itp.nextQ); 

if (term.exp < pTerm.exp) { 
q.list.add(new Term(term)) ; 
itp.previous(); 
else if (term.exp == pTerm.exp) { 
q.list.add(new Term(term.coef + pTerm.coef, term.exp)); 
else { // (term.exp > pTerm.exp) 
q.list.add(new Term(pTerm)) ; 
it.previous(); 


} 


ir 
while (it.hasNext()) { 
q-list.add(new Term(it.next())); 


ww 


} 

while Citp.hasNext()) { 
q-list.add(new Term(itp.next())); 

Hs 


return q; 


public String toString() { 


if (this.isZeroQ) { 
return "0"; 
} 
Iterator<Term> it = list.iterator(; 
StringBuilder buf = new StringBuilder(Q; 
boolean isFirstTerm = true; 
while Cit.hasNextQ) { 
Term term = it.nextQ; 
double c = term.coef; 
int e = term.exp; 
if CisFirstTerm) { 
buf.append(String. format("%.2f", c)); 
isFirstTerm = false; 
t+ else { 
if Cterm.coef < 0) { 
buf.append(String.format(" - %.2f", -c)); 
t+ else { 
buf.append(String.format(" + %.2f", c)); 
} 
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92 } 

93 if (e = 1) { 

94 buf.append("x"); 

95 } else if (e > 1) f{ 

96 buf.append("xA" + e); 

97 } 

98 } 

99 return buf.toStringQ ; 

100 } 

101 

102 private static class Term { 

103 private double coef; 

104 private int exp; 

105 

106 public Term(double coef, int exp) { 
107 if (coef == 0.0 || exp < 0) { 

108 throw new I1legalArgumentException() ; 
109 } 

110 this.coef = coef; 

111 this.exp = exp; 

112 } 

113 

114 public Term(Term that) { // copy constructor 
115 this(that.coef, that.exp); 

116 } 

117 } 

118 } 


Instead of using inheritance by extending a List class, this Polynomial class uses composition, 
declaring a list field at line 2. This design gives the user more control over the class by limiting it to only 
those methods actually defined in the class. Of course, those methods are mostly implemented by means 
of List methods. The 1ist backing structure is declared to be a LinkedList<Term> collection. So each 
element of the list is a Term object. The list stores the polynomial’s nonzero terms in increasing order of 
their exponents. 

The Term class is defined as an inner class at lines 102—117, that is, a static member class. It has two 
fields, coef and exp (for the coefficient and the exponent of the term), and two constructors. The second 
constructor (line 114) is a copy constructor, creating a duplicate of the term passed to it. It uses the this 
keyword to invoke the two-argument constructor defined at line 106. 

The Polynomial class has four constructors and four methods. The default constructor (also called the 
“no-arg constructor”) defined at line 8 is declared to be private. This prevents it from being used outside 
of its class. Its purpose is to construct the Polynomial object that represents the zero polynomial, and it is 
invoked at line 3 to do that. To ensure that that object is unique, we prevent it from being constructed 
anywhere else. 

The constructor at line 8 creates a Polynomial object that represents a single term, such as 88.844. 
The constructor at line 14 is a copy constructor, which duplicates the object passed to it. The constructor at 
line 20 uses the Java “var-args” syntax to allow a variable number of arguments, in this case of type 
double. This is the same as a single argument of type double[]. That constructor creates a polynomial 
whose coefficients are the nonzero values in the array, each one generating a term of the form c,,x”, where 
c, = a[n]. For example, the array {4, 0, 7, 0, 0, 0, 3} would produce the Polynomial object that repre- 
sents 4 + 7x* + 3x°. 

The degree() method at line 28 returns the polynomial’s highest exponent. Since the terms are 
maintained in the list in increasing order of their exponents, the degree of the polynomial is the simply the 
exp field of the last element in the list. That element has index 1ist.size()-1, so the expression at line 
32 does the job. 
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The plusQ method at line 40 returns a new object that represents the sum of the implicit argument 
(this) and the explicit argument (p). That result, instantiated as q at line 44, is built by means of the three 
while loops ate lines 50-68. The first loop repeatedly compares the exponents of a term from each 
polynomial (this and p), duplicates the one with the smaller exponent, and adds it to q. If the two terms 
are equal, then their coefficients are added (at line 57) to form the new term. The loops use iterators to 
traverse the two lists. Since the term whose coefficient was not yet used has to be accessed again on the 
next iteration, its iterator has to be backed up (at lines 55 and 60). Consequently, we need the bidirectional 
iterators provided by the listIterator() method. 

The toString() method uses a unidirectional iterator to traverse the list at line 80 to generate a string 
representation of the Polynomial object. 


EXAMPLE 7.8 Testing the Polynomial Class 


1 public class TestPolynomial { 

2 public static void main(String[] args) { 

3 Polynomial p = new Polynomial(3, -8, 0, 0, 2, 1); 
4 Polynomial q = new Polynomial(O, 5, 6, 9); 

5 System.out.printInC"p: " + p); 

6 System.out.printInC"p.degree(): " + p.degree()); 
7 System.out.printInC"q: " + q); 

8 System.out.printIn("q.degree(): " + q.degreeQ)); 
9 System.out.printInC"p.plus(q): " + p.plus(q)); 
10 System.out.printInC"q.plus(p): " + q.plus(p)); 

1 System.out.printInC"p.plus(q).degree(Q): " + p.plus(q).degree()); 
12 Polynomial z = new Polynomial (0); 

13 System.out.printIn("z: " + z); 

14 System.out.printIn("z.degree(): " + z.degreeQ)); 
15 System.out.printIn("p.plus(z): " + p.plus(z)); 
16 System.out.printIn("z.plus(p): " + z.plus(p)); 
17 System.out.printInC"p: "+ p)5 

18 Polynomial t = new Polynomial(8.88, 44); 

19 System.out.printInC"t: " + t); 

20 System.out.printInC"t.degree(): " + t.degreeQ); 
21 } 

22 } 


The output is: 
p: 3.00 - 8.00x + 2.00xA4 + 1.00xA5 
p.degree(): 5 


1 
-plus(z): 3.00 - 8.00x + 2.00xA4 + 1.00xA5 
-plus(p): 3.00 - 8.00x + 2.00xA4 + 1.00xA5 

3.00 - 8.00x + 2.00xA4 + 1.00xA5 


q: 5.00x + 6.00xA2 + 9.00xA3 

q.degree(): 3 

p.plus(q): 3.00 - 3.00x + 6.00xA2 + 9.00xA3 + 2.00xA4 + 1.00xA5 
q.plus(p): 3.00 - 3.00x + 6.00xA2 + 9.00xA3 + 2.00xA4 + 1.00xA5 
p.plus(q).degreeQ: 5 

z: 0 

z.degreeQ): - 

p 

Zz 

p: 

t 


: 8.88xA44 
t.degree(Q): 44 
The var-args constructor is tested at lines 3-4, and the two-argument constructor is tested at line 18. 
The other testing includes checking (at lines 9-10) that the plus() method is commutative: p + q=q +p, 
and (at lines 14-16) that the zero polynomial z satisfies the defining condition p + z=z+p=p. 
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Review Questions 


What is the difference between the Collection interface and the List interface? 


What is the difference between the AbstractCollection class and the AbstractList 
class? 


What is the difference between the AbstractList class and the AbstractSequentialList 
class? 


What is the difference between the Iterator interface and the ListIterator interface? 
What is the difference between the ArrayList class and the LinkedList class? 
What is the difference between an ArrayList object and a Vector object? 


In deciding whether to use an ArrayList or a LinkedList in an application, what factors 
make one choice better than the other? 


Problems 


Implement the following method: 
public static void loadRandomLetters(LinkedList list, int n) 
// fills list with n randomly generated capital letters 


Write a method that uses an iterator to print the contents of a linked list, one object per line. 


Write a method that uses an iterator to print the contents of a linked list in reverse order, one 
object per line. 


Write the following method: 
public static void exchange(LinkedList list, int i, int j) 
// swaps the elements indexed at i and j 


Modify the solution to the Josephus Problem (Example 7.6 on page 140) so that it also uses a 
SKIP parameter to generate the output. The value of SKIP is a constant nonnegative integer 
that specifies whom each soldier should kill. For example, if skip = 2, then A would kill D 
(skipping over B and C), E would kill H, and so forth. The original solution is then the spe- 
cial case where skip = 0. Assume that no one commits suicide. So if a killer’s target turns 
out to be himself, he would kill the next man in the list. 


Write and test this method for the Polynomial class: 
public double valueAt(double x) 
// returns the y-value of p at the specified x-value 


Write and test this method for the Polynomial class: 
public Polynomial times(double factor) 
// returns a new polynomial that is equal to this polynomial 


Answers to Review Questions 


The List interface includes these 10 methods that work with indexes: 


public void add(int index, Object object); 

public boolean addAl1Cint index, Collection collection); 
public Object getCint index); 

public int indexOf (Object object); 


public int lastIndexOf (Object object); 
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public ListIterator listIterator(); 
public ListIterator listIterator(Cint index); 


public Object cemeve CME IOHey 
public Object set(int index, Object object); 
public List subList(int start, int stop); 


The AbstractList class implements the methods of the List interface, including the 10 index meth- 
ods listed above in the answer to Question 7.1 which are not in the AbstractCol lection class. 


The AbstractSequentialList class is designed to serve as a base class for linked list classes. It 
specifies the two abstract methods listIterator() and size() which must be implemented by any 
concrete subclass. 


The ListIterator class extends the Iterator class in a way that is analogous to the way the 
AbstractSequentialList class extends the AbstractList class. (See the answer to Question 7.2 
above.) Ordinary Iterator objects are unidirectional iterators that iterate on array lists; 
ListIterator objects are bidirectional iterators that iterate on linked lists. 


Instances of the ArrayList class use contiguous, indexed, direct access (array) storage. Instances of 
the LinkedList class use linked (sequential) access storage. So array lists provide faster access, while 
linked lists provide faster modifications (insertions and removals). 


There isn’t much difference between an ArrayList object and a Vector object: They both provide 
direct indexed access. As part of the Java Collections Framework, the ArrayList class was intro- 
duced more recently, in Java 1.2, and so it is probably more preferred. The Vector class has about 
twice as many methods, but many are redundant and consequently a bit confusing. 


An ArrayList object should be preferred when frequent lookups are expected. A LinkedList object 
should be preferred when frequent additions and/or removals are expected. (See the answer to Ques- 
tion 7.6 above.) 


Solutions to Problems 


void loadRandomLetters( LinkedList list, int n) { 
list.clearQ; 
while (O < n--) 
list.add("" + Cchar)C'A' + Cint) (Math. random() *26))); 
} 
void printForward(LinkedList list) { 
for (ListIterator itr=list.listIterator(); itr.hasNextQ); ) 
System.out.printIn(itr.nextQ); 
} 
void printBackward(LinkedList list) { 
ListIterator itr=list.listIterator(list.sizeQ); 
while (itr.hasPrevious()) 
System.out.printInCitr.previous()); 


c 


void exchange(LinkedList list, int i, int j) { 
Object ithObj = list.get(i); 
Object jthObj = list.get(j) ; 
list.setCi,jthObj); 
list.set(j,ithObj) ; 

} 


The solution to the generalized Josephus Problem: 
public class Josephus { 
public static final int SOLDIERS = 11; 
public static final int SKIP = 2; 
public static final String ALPHA = "ABCDEFGHIJKLMNOPQRSTUVWXYZ" ; 
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public static void main(String[] args) { 
Ring<String> ring = new Ring<String>() ; 
for Cint i=0; i<SOLDIERS; i++) { 
ring.add(ALPHA.substring(i, i+1)); 
} 
System.out.printInCring) ; 
Iterator<String> it = ring.iterator(); 
String killer = it.nextQ; 
String victim = null; 
while Cring.size(Q) > 1) { 
for Cint i = 0; i <= SKIP; i++) { 
victim = it.nextQ); 
} 
if (victim == killer) { 
victim = it.nextQ); 
} 
System.out.printInCkiller + " killed " + victim); 
it.remove(); 
killer = it.nextQ; 


} 
System.out.printInC"The lone survivor is " + it.nextQ); 
} 
} 
7.6 public double valueAt(double x) { 


// returns the y-value p(x) of p at the specified x-value 
// This implements Horner's method. 
if (this.isZeroQ) { 
return 0.0; 
} 
ListIterator<Term> it = list. listIteratorQ; 
Term term = null; 
while (Cit.hasNextQ)) { // move to last element 
term = it.nextQ); 
} 
term = it.previous(); 
double y = term.coef; 
int n = term.exp; 
while (Cit.hasPrevious()) { 
term = it.previous(); 
y *= Math.pow(x, n - term.exp); 
y += term.coef; 
n = term.exp; 


} 
y *= Math.pow(x, n); 
return y; 
} 
del: public Polynomial times(double factor) { 


// returns a new polynomial that is equal to this polynomial 
if (this.isZeroQ || factor == 0.0) { 
return ZERO; 
} 
Polynomial p = new Polynomial (this); 
for (Term t : p.list) { 
t.coef *= factor; 
} 


return p; 


CHAPTER 8 


Hash Tables 


A hash table (also called a map, a lookup table, an associative array, or a dictionary) 1s a 
container that allows direct access by any index type. It works like an array or vector except that 
the index variable need not be an integer. A good analogy is a dictionary; the index variable is the 
word being looked up, and the element that it indexes is its dictionary definition. 

A table is a sequence of pairs. The first component of the pair is called the key. It serves as the 
index into the table, generalizing the subscript integer used in arrays. The second component is 
called the value of its key component. It contains the information being looked up. In the dictio- 
nary example, the key is the word being looked up, and the value is that word’s definition (and 
everything else listed for that word). 

A table is also called a map 
because we think of the keys 


being mapped into their values, Object 

like a mathematical function: |_abstractMap.---------- Mab 

fikey) = value. Tables are also |— EnumMap 

called an associative arrays | HashMap 

because they can be imple- LinkedHashMap SortedMap 

: \— TreeMap. --------------- Navigab1leMap 


mented using two parallel arrays; 


the keys in one array and the Figure 8.1 The Map classes of the JCF 
values in the other. 


THE JAVA Map INTERFACE 


The Java Collections Framework includes a Map interface, as shown in Figure 8.1. It is defined 
in the java.util package like this: 
public interface Map { 
int sizeQ); 
boolean isEmptyQ(); 
boolean containsKey(Object key); 
boolean containsValue(Object value); 
Object get(Object key); 
Object put(Object key, Object value); 
Object remove(Object key); 
void putAll1(Map map) ; 
void clear(); 
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public Set keySet(); 
public Collection values(); 
public Set entrySetQ); 
public interface Entry { 
Object getKey(); 
Object getValueQ; 
Object setValue(Object value); 
boolean equals(Object 0); 
int hashCodeQ) ; 


} 


boolean equals(Object 0); 
int hashCode(); 


} 


THE HashMap CLASS 
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As the class hierarchy in Figure 8.1 shows, Java defines four implementations of its Map inter- 
face: the AbstractMap class, the HashMap class, the TreeMap class, and the WeakHashMap class. 


EXAMPLE 8.1 A German-English Dictionary 


This program uses the HashMap class to build a German-English dictionary: 


1 

2 

3 

4 

5 

6 

7 

8 

9 
10 System.out. 
11 System. out. 
12 System.out. 
13 System.out. 
14 System.out. 
15 System.out. 
16 System.out. 
17 System. out 
18 System.out. 
19 } 
20 } 

The output is: 


map.putC("Tor","gate") ; 


printInC"map= 
printInC"map. 
printInC"map. 
printInC"map. 
printInC"map. 
printInC"map. 
printInC"map. 


.printlnC"map= 


printInC"map. 


public class TestDictionary { 
public static void main(String[] args) { 
Map map = new HashMap(); 
map.put("Tag","day"); 
map.putC"Hut", "hat") ; 
map.putC"Uhr","clock"); 
map.put("Rad", "wheel"); 
map.putC"Ohr","ear"); 


+ map); 

sizeQ=" + map.size()); 

keySetQ=" + map.keySetQ); 

values(Q)=" + map.values()); 
get(\"Uhr\")="_ + map.getC"Uhr")); 
remove(\"Rad\")="_+ map.remove("Rad")); 
get(\"Rad\")="_+ map.getC"Rad")); 

"4+ map); 

sizeQ=" + map.size()); 


map={Rad=wheel, Uhr=clock, Ohr=ear, Tor=gate, Hut=hat, Tag=day} 


map.size()=6 


map.keySet(Q)=[Rad, Uhr, Ohr, Tor, Hut, Tag] 
map.values()=[wheel, clock, ear, gate, hat, day] 
map.getC"Uhr")=clock 
map. remove("Rad")=wheel 
map.get("Rad")=nul1 
map={Uhr=clock, Ohr=ear, Tor=gate, Hut=hat, Tag=day} 


map.size()=5 


The put() method inserts key/value pairs into the table. For example, 
map.put("Tag", "day"); 


inserts the key/value pair ("Tag", "day"), where "Tag" is the key and "day" is the value. 
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The first call to printinQ invokes the HashMap. toString() method, printing the entire Map object. 
The second call to printInQ invokes the HashMap.size() method, showing that the Map object has six 
key/value elements. The next call to printInQ invokes the HashMap. keySet() method, which returns a 
Set object containing all the keys (the six German words). The next call to printInQ© invokes the 
HashMap.values() method, which returns a Collection object containing all the values (the six 
English words). The next call to printInQ invokes the HashMap.get() method, which returns the value 
for a given key. This call returns the value object "clock" for the key object "Uhr". The next call to 
printInQ invokes the HashMap. remove() method, which deletes the ("Rad", "wheel") pair, which is 
confirmed by the next call: map.get("Rad") returns nul11, indicating that there is no key/value pair in 
map whose key is "Rad". The last two lines prints the entire map again and its size, revealing that the 
C"Rad", "wheel") pair has indeed been deleted. 


The order in which the key/value pairs were stored in the HashMap object in Example 8.1 
seems to be random and unrelated to the order in which the pairs were inserted. The next 
example verifies this. 


EXAMPLE 8.2 Java HashMap Objects Are Hash Tables 


This program creates two independent HashMap objects and loads them with the same key/value pairs 
but in different orders: 

1 public class TestHashTable { 

2 public static void main(String[] args) { 

3 Map map1 = new HashMap(); 

4 mapl.putC("Tor","gate") ; 

5 map1.put("Rad", wheel"); 

6 mapl.put("Tag","day") ; 

7 map1.putC"Uhr", "clock"); 

8 map1.putC"Hut", "hat") ; 

9 map1.putC"Ohr","ear") ; 


10 System.out.printInC"mapl="_ + map1); 
1 Map map2 = new HashMap(); 

12 map2.put("Rad", wheel"); 

13 map2.putC"Uhr", "clock"); 

14 map2.putC"Ohr","ear") ; 

15 map2.put("Tag","day"); 

16 map2.putC("Tor","gate") ; 

17 map2.putC"Hut", "hat") ; 

18 System.out.printInC"map2="_ + map2); 
19 } 


The output is: 
mapl={Rad=wheel, Uhr=clock, Ohr=ear, Tor=gate, Hut=hat, Tag=day} 
map2={Rad=wheel, Uhr=clock, Ohr=ear, Tor=gate, Hut=hat, Tag=day} 
The order in which the key/value pairs are stored in the HashMap table is reflected by the output from 
the toString() method. That stored order is same in both tables, independent of the order in which they 
were inserted. Note that it is also the same stored order in the HashMap table in Example 8.1. 


JAVA HASH CODES 


The order in which the key/value pairs are stored in a HashMap table depends only upon the 
capacity of the table and the values of objects’ the hash codes. Recall (Chapter 4) that every 
object in Java is given an intrinsic hash code, which is computed from the actual hard data stored 
in the object. The Object .hashCode() method returns that code for each object. 
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EXAMPLE 8.3 Hash Codes of Some String Objects 


This program prints the intrinsic hash codes of the String objects stored in the previous programs: 
public class TestHashCodes { 
public static void main(String[] args) { 


1 

2 

3 printHashCode("Rad") ; 
4 printHashCode("Uhr") ; 
5 printHashCode("Ohr") ; 
6 printHashCode("Tor"); 
7 printHashCode("Hut") ; 
8 printHashCode("Tag") ; 
9 


} 
10 
11 private static void printHashCode(String word) { 
12 System.out.printf("%s: %s%n", word, word.hashCode(Q)); 
13 } 
14 } 
The output is: 
Rad: 81909 
Uhr: 85023 
Ohr: 79257 
Tor: 84279 
Hut: 72935 
Tag: 83834 


The fact that all six codes are relatively close 5-digits integers reflects the fact that these String 


objects all have length 3. 


HASH TABLES 


A hash table is a table that uses a special function to compute the location of data values from 
their key values instead of storing the keys in the table. The special function is called the hash 
function for the table. Since the lookup time is independent of the size of the table, hash tables 


have very fast access time. 

Java defined a Hashtable class in its java.util package. But it has 
essentially been upgraded to the HashMap class. That is, a HashMap table 
can do everything that a Hashtable object can do. Moreover, the 3| Rad 
HashMap class is more consistent with rest of the Java Collections Frame- ;| Hut 


work. 


A general hash table looks like the one pictured in Figure 8.2: an array 
of Objects indexed by their hash values. This requires that the range of 
the hash function match the range of index values in the array. This is 


Figure 8.2 Hash table 


almost always managed by simply using the remainder operator modulo 
the size of the array: 


EXAMPLE 8.4 Mapping Keys into a Hash Table of Size 11 


This program prints hash code values for String objects to be stored in a hash table of size 11: 


BB wWwNnN = 


public class TestHashing { 
private static final int MASK = Ox7FFFFFFF; // = 2A32-1 
private static final int CAPACITY = 11; 
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5 public static void main(String[] args) { 
6 printHash("Rad") ; 
7 printHashC"Uhr") ; 
8 printHashC"Ohr") ; 
9 printHashC("Tor"); 


10 printHashC"Hut") ; 
"1 printHash("Tag") ; 
12 } 
13 
14 private static void printHash(String word) { 
15 System.out.printInC"hashC" + word + ") = " + hash(word)); 
16 } 
17 
18 private static int hash(Object object) { 
19 return Cobject.hashCodeQ) & MASK) % CAPACITY; 
20 } 
21 } 
The output is: 
hash(Rad) = 3 
hash(Uhr) = 4 
hash(Ohr) = 2 
hash(Tor) = 8 
hash(Hut) = 5 
hash(Tag) = 3 


The hash function values are computed at line 19, where CAPACITY is 11 and MASK is 2147483647, 
expressed in hexadecimal as Ox7FFFFFFF. The operation n & MASK simply removes the sign from whatever 
integer n has. This is the right thing to do in Java before using the remainder operator to compute an array 
index because (unlike C++) Java may give a negative result tom % CAPACITY if mis negative. So the 
resulting value returned by the hash() function in this example is guaranteed to be in the range of 0 to 
10. 

The first five strings hash into index values 3, 4, 2, 8, and 5, so they would be stored in those locations 
in the hash table. But the sixth string ("Tag") also hashes to 3, causing a collision with "Rad", which 
would already be stored in component 3. The most common algorithm to apply when such collisions 
occur is to simply put the new item in the next available component. That would be component 6 in this 
example, since "Uhr" would already be in component 4 and "Hut" would already be in component 5. 
This “collision resolution” algorithm is called /inear probing. 


The HashMap class uses a hash function just like the one in Example 8.4 to implement its 
accessor methods: containsKey(), getQ, put(Q), remove(), and entrySet(). Its sets the 
hash table size at 101 initially. With that knowledge, we can see why the six strings in the previ- 
ous examples were stored in the order indicated. 


EXAMPLE 8.5 Mapping Keys into a Hash Table of Size 101 


This program is identical to the one in Example 8.4 except that the hash table CAPACITY is 101 instead 
of 11: 
public class TestHashing { 
private static final int MASK = Ox7FFFFFFF; // = 2A32-1 
private static final int CAPACITY = 101; 


printHash("Rad") ; 
printHashC"Uhr") ; 


1 
2 
3 
4 
5 public static void main(String[] args) { 
6 
7 
8 printHashC"Ohr") ; 
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9 printHash("Tor"); 
10 printHashC("Hut") ; 
11 printHash("Tag") ; 
12 } 
13 
14 private static void printHash(String word) { 
15 System.out.printInC"hash(" + word + ") = " + hash(word)); 
16 } 
17 
18 private static int hash(Object object) { 
19 return (object.hashCode() & MASK) % CAPACITY; 
20 } 
21 } 
The output is: 


hash(Rad) = 99 
hash(Uhr) = 82 
hash(Ohr) = 73 
hash(Tor) = 45 
hash(Hut) = 13 
hash(Tag) = 4 
The result is that the items are stored in reverse order from which they are accessed. 


HASH TABLE PERFORMANCE 


A hash table of size 101 that contain six elements will perform very well. It is very unlikely to 
have any collisions, so the access functions are immediate, running in constant time O(1). This is 
direct access, just like an array. 

But a hash table of size 101 that contains 100 elements is likely to perform very poorly 
because there will have been many collisions in the process of storing its elements. For example, 
if the string "Lob" had 60 collisions before a free component was found for it, then each time it 
is accessed, it will take 60 “probes” to find it. That kind of performance is close to O(7) —not 
much better than a linked list. 

The solution to the problem described here is to prevent the hash table from becoming too full. 
This is done by resizing it whenever it reaches a threshold size. 

The measure of fullness depends upon two parameters: The size of the hash table is the actual 
number of elements in the table; the capacity of the table is the number of components that it has. 
The ratio of these two parameters is called the /oad factor. In the first example cited in this 
section, the size was 6 and the capacity was 101, resulting in a load factor of 6/101 = 5.94 
percent. In the second example, the size was 100, resulting in a load factor of 100/101 = 99.01 
percent. 

The HashMap class automatically resizes its hash table when the load factor reaches a specific 
threshold value. This threshold value can be set when the hash table is created, using the 
constructor 

public HashMapCint initialCapacity, float loadFactor) 
which also allows the initial capacity to be set. If you use a constructor that does not take one or 
the other of these two arguments, then the default values of capacity 101 and load threshold 75 
percent will be used. 
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COLLISION RESOLUTION ALGORITHMS 


The collision resolution algorithm used in the previous examples is called linear probing. 
When a new item hashes to a table component that is already in use, the algorithm specifies to 
increment the index until an empty component is found. This may require a “wraparound” back 
to the beginning of the hash table. 


EXAMPLE 8.6 Linear Probing 


This program extends the program in Example 8.4. It keeps track of which table components are used 


and the load factor after each hashing. 
public class Ex1406 


1 

2 public class TestLinearProbing { 

3 private static final int MASK = Ox7FFFFFFF; // 2A32-1 
4 private static final int CAPACITY = 11; 

5 private static int size = 0; 

6 private static boolean[] used = new boolean[CAPACITY] ; 
rs 
8 
9 


public static void main(String[] args) { 


printHash("Rad") ; 


10 printHashC"Uhr") ; 
"1 printHashC"Ohr") ; 
12 printHash("Tor"); 
13 printHashC"Hut") ; 
14 printHash("Tag") ; 
15 printHash("Eis"); 
16 printHashC"Ast"); 
17 printHash("Zug") ; 
18 printHashC"Hof") ; 
19 printHash("Mal"); 
20 } 
21 
22 private static void printHash(String word) { 
23 System.out.printfC"hash(%s) = %d, 
24 word, hash(word), 100*size/CAPACITY) ; 
25 } 
26 
27 private static int hash(Object object) { 
28 ++51Ze; 
29 int h = Cobject.hashCode() & MASK) % CAPACITY; 
30 while Cused[h]) { 
31 System.out.printfC"%d, ", h); 
32 h = Ch+1)%CAPACITY; 
33 } 
34 used[h] = true; 
35 return h; 
36 } 
37 } 
The output is: 
hash(Rad) = 3, load = 9% 
hash(Uhr) = 4, load = 18% 
hash(Ohr) = 2, load = 27% 
hash(Tor) = 8, load = 36% 
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hash(Hut) = 5, load = 45% 


3, 4, 5, hash(Tag) = 6, load = 54% 


load = %d%%%n" , 
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5, 6, hashCEis) = 7, load = 63% 
3, 4, 5, 6, 7, 8, hash(Ast) = 9, load = 72% 
9, hash(Zug) = 10, load = 81% 
3, 4, 5, 6, 7, 8, 9, 10, hashCHof) = 0, load = 90% 
, 3, 4, 5, 6, 7, 8, 9, 10, 0, hash(Mal) = 1, load = 100% 

The size field contains the number of items hashed into the table. The used[] array flags which 
components are occupied in the table. The printHash() method prints the hash table index and the 
resulting load factor as a percent. When linear probing kicks in, each successive index number probe is 
printed. 

As seen in Example 8.4, the collision occurs with the insertion of "Tag" at line 14. This program shows 
that it had three collisions (at index numbers 3, 4, and 5) before finding a free hash location at index 6. 
After that insertion, the table is 54 percent full. 

Every item after that also collides. And of course, as the table fills up, the number of collisions 
becomes more frequent. The last item, "Mal", has 10 collisions. That means that thereafter, every time 
this item is accessed it will have to search every one of the 11 items before it is found; clearly an O(n) 
process. 

Notice the index “wraparound” on the insertion of "Mal": 2, 3, 4, 5, 6, 7, 8, 9, 10, 0, 1. 


Another collision resolution algorithm that usually performs better than linear probing is 
called quadratic probing. This algorithm jumps over items in its probing, with the result that the 
used components are more uniformly distributed with fewer large clusters. That improves perfor- 
mance because the resulting probe chains are shorter. 


EXAMPLE 8.7 Quadratic Probing 


This program is the same as the program in Example 8.6 except for the modified hash() function 
shown here. 


38 public class TestQuadraticProbing { 

39 private static final int MASK = Ox7FFFFFFF; // 2A32-1 
40 private static final int CAPACITY = 11; 

a private static int size = 0; 

42 private static boolean[] used = new boolean[CAPACITY] ; 
43 

44 public static void main(String[] args) { 

45 printHash("Rad") ; 

46 printHashC("Uhr") ; 

47 printHashC"Ohr") ; 

48 printHash("Tor"); 

49 printHashC"Hut") ; 

50 printHash("Tag") ; 

51 printHash("Eis"); 

52 printHashC"Ast"); 

53 printHash("Zug") ; 

54 printHashC("Hof") ; 

55 printHash("Mal"); 

56 } 

57 

58 private static void printHash(String word) { 

59 System.out.printfC"hash(%s) = %d, load = %d%%%n", 
60 word, hash(word), 100*size/CAPACITY) ; 

61 } 

62 

63 private static int hash(Object object) { 


64 ++51Ze; 
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65 int h = (Cobject.hashCode() & MASK) % CAPACITY; 
66 if Cused[h]) { 
67 int hO = h; 
68 int jump = 1; 
69 while Cused[h]) { 
70 System.out.printfC"%d, ", h); 
71 h = ChO + jump*jump)%CAPACITY; // squared increment 
72 ++jump; 
73 } 
74 
75 used[h] = true; 
76 return h; 
77 } 
78 } 
The output is: 
hash(Rad) = 3, load = 9% 
hash(Uhr) = 4, load = 18% 
hash(Ohr) = 2, load = 27% 
hash(Tor) = 8, load = 36% 
hash(Hut) = 5, load = 45% 


3, 4, hash(Tag) = 7, load = 54% 

5, hashCEis) = 6, load = 63% 

3, 4, 7, hash(Ast) = 1, load = 72% 
hash(Zug) = 9, load = 81% 

The essential difference here is in the sequence of index numbers probed within the while loop when a 
collision occurs. Instead of searching linearly, it uses a squared increment. For example, when the inser- 
tion of "Ast" collides at index 3, linear probing continued probing at indexes 4, 5, 6, 7, 8, and 9 (in 
Example 8.6). But with quadratic probing, only indexes 3, 4, 7, and 1 (= 12 mod 11) are probed, using 
successive jumps of 1, 4, and 9 (17, 27, and 37). Linear probing required 50 percent more probes. 


The price that the quadratic probing algorithm pays for its improved performance is that it is 
more likely to result in an infinite loop. That happens in Example 8.7 with the next insertion. The 
string "Hof" hashes initially to index 3. After eight collisions, the linear probing algorithm found 
a free cell at index 0 (= 11 mod 11). But the probe sequence used on this item by the quadratic 
probing algorithm is the same as for "Ast": 3, 4, 7, 1, 8, 6, 6, 8 1, 7, 4,3, 4,.... This is 
computed from the unmodulated quadratic sequence 3, 4, 7, 12, 19, 28, 39, 52, 67, 84, 103, 124, 
147, .... This continues indefinitely, probing only the six indexes 3, 4, 7, 1, 8, and 6, all of 
which have already been used. So even though the table is only 81% full, the insertion fails. That 
can’t happen with linear probing. 


SEPARATE CHAINING 


Instead of devising a more effective collision resolution algorithm, we can avoid collisions 
altogether by allowing more than one item per table component. This method is called separate 
chaining, because is uses linked lists (“chains”) to hold the multiple items. In this context, the 
table components are usually called “buckets.” 


EXAMPLE 8.8 Separate Chaining 


Here is how part of a definition for a HashTab1e class might look, using separate chaining: 
1 public class HashTable { 
2 private static final int MASK = Ox7FFFFFFF; // 2A32-1 
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3 private static int capacity = 101; 

4 private static int size = 0; 

5 private static float load = 0.75F; 

6 private static LinkedList[] buckets; 
v4 

8 HashTable() { 

9 buckets = new LinkedList[capacity]; 
10 for Cint i = 0; i < capacity; i++) f{ 
1 buckets[i] = new LinkedListQ; 

12 } 

13 } 

14 

15 HashTableCint capacity, float load) { 
16 thisQ; 

17 this.capacity = capacity; 

18 this. load = load; 

19 } 

20 

21 Object put(Object key, Object value) { 
22 int h = hash(key); 

23 LinkedList bucket=buckets [h] ; 

24 Object oldValue = null; 

25 for (ListIterator it = bucket.iteratorQ); it.hasNextQ; ) f{ 
26 Map.Entry entry = it.nextQ); 

27 if Centry.getKey().equals(key)) { 
28 break; 

29 } 

30 } 

31 if Centry.getKey().equals(key)) { 
32 oldValue = entry.setValue(value) ; 
33 } else { 

34 bucket.add(new Entry(key,value)); 
35 } 

36 return oldValue; 

37 } 

38 

39 // more methods... 

40 } 


Note that put() serves two different purposes. If the table already has an entry with the given key, it 
only changes the value of that entry. Otherwise, it adds a new entry with that key/value pair. 


The java.util .HashMap class uses separate chaining in a way that is similar to that shown in 
Example 8.8. 


APPLICATIONS 


Tables are widely used in systems programming. Moreover, they are the primary building 
blocks of relational databases. 
Here is an example in applications programming. 


EXAMPLE 8.9 A Concordance 


A concordance is a list of words that appear in a text document along with the numbers of the lines on 
which the words appear. It is just like an index of a book except that it lists line numbers instead of page 
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numbers. Concordances are useful for analyzing documents to find word frequencies and associations that 
are not evident from reading the document directly. 

This program builds a concordance for a text file. The run here uses this particular text taken from 
Shakespeare’s play Julius Caesar. The first part of the resulting concordance is shown on the right. 

This output results from obtaining a Set “view” of the concordance and then iterating through the set, 
printing one element per line. Each element is a Map. Entry object whose Key is the word from the text 
(in all uppercase letters) and whose Value is the listing of line numbers where the word was found. For 
example, the word “man” was found on lines 10, 15, 22, and 27. Line 10 is 

For Brutus is an honourable man; 

Here is the Concordance class: 


1 public class Concordance { 

2 private Map<String,String> map = new HashMap<String,String>Q() ; 
3 

4 public Concordance(String file) { 

5 int lineNumber = 0; 

6 try { 

7 Scanner input = new Scanner(new File(file)); 

8 while Cinput.hasNextLine()) { 

9 String line = input.nextLineQ; 

10 ++]lineNumber; 

1 StringTokenizer parser = new StringTokenizer(line, ",.;:Q-!?' "); 
12 while (parser.hasMoreTokens()) { 

13 String word = parser.nextToken() .toUpperCase() ; 
14 String listing = map.get(word) ; 

15 if Clisting == null) f{ 

16 listing = "" + lineNumber; 

17 } else { 

18 listing += ", " + lineNumber; 

19 } 

20 map.put(word, listing); 

21 } 

22 } 

23 input.close(); 

24 } catchCIOException e) { 

25 System.out.printIn(e); 

26 } 

27 } 

28 

29 public void write(String file) { 

30 try { 

31 PrintWriter output = new PrintWriter(file); 

32 for (Map.Entry<String,String> entry : map.entrySetQ)) { 
33 output.printIn(Centry) ; 

34 } 

35 output.closeQ; 

36 } catch(IOException e) { 

37 System.out.println(e); 

38 } 

39 } 

40 } 


The hash table is defined at line 2. It has type java.util.HashMap<String, String>, which means 
that both its key and value fields have type String. Its constructor at line 4 takes the name of the input file 
as its argument. It uses a java.util .Scanner object to read the file line by line. Each line is parsed by a 
java.util.StringTokenizer object named parser, defined at line 11. Note that the parser uses as 
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Shakespeare .txt Shakespeare.out 
Friends, Romans, c , lend me your ears! STUFF=20 
I come to bury 3 yt to praise him. 
The evil that men do lives af them, 
The good is oft with their bo 
So let it be with Ca 
Hath told yo 
If it were so, it 
And grievously hath 
Here, under leave of Brutus and the rest, -- 
For Brutus is an honourable man; 
So are they all, all honourable men. MOURN=31 
Come I to speak in Caesar's funeral. FRIENDS=1 
He was my friend, faithful and just to me. 
But Brutus says he was ambitious; 
And Brutus is an honourable man. ! : 
He hath brought : \ he CROWN=24 
Whose ransoms did the general coff Ss fai: SHOULD=20 
Did this in Caesar seem ambitious: INTERRED 
When that the poor have cried, Caesar hath wept; =19 
Ambition should be made of sterner stuff. 
Yet Brutus s he was ambitious; FRIEND=1 
And Brutus an honourable man. BUT=14, 
You all di ee that on the Lupercal BRUTUS= 
I thrice presented him with a kingly cro 0 
Which he did thrice refuse: w 
Yet Brutus says he was ambitious; 
And, sure, an honourable man. 
I speak not to di ove what Brutus 
But here I am to speak what I do kn 

did love him once, not without 


c 


se withholds yo then, to mourn for him? 
ment! thou ar fled to brutish beasts, 
And men have lost their reason! 


Figure 8.3. The concordance in Example 8.9 


delimiters all of the 11 characters in the string ", .;:()-!?'". Each parsed word is used as a key in the 
hash table. The corresponding value is the string of line numbers that is accumulated at lines 16 and 18. 
The write() method at line 29 uses a for-each loop to print the concordance to the specified file. At 
line 32, the map’s entrySet() method returns a set of elements of type Map. Entry<String, String>. 
These are the key/value pairs that are stored in the hash table. Each key is a word from the input file and 
its entry is the list of the line numbers of the lines where that word appears in the text. 
Here is the test program: 


1 public class TestConcordance { 

2 public static final String PATH = "B:\\DSWJ2\\src\\ch08\\ex09\\"; 
3 public static final String IN_FILE = "Shakespeare. txt"; 

4 public static final String OUT_FILE = "Shakespeare.out"; 

5 

6 public static void main(String[] args) { 

7 Concordance c = new Concordance(PATH+IN_FILE) ; 

8 c.writeC(PATH+OUT_FILE) ; 

9 } 

10 } 


The output from the program in Example 8.9 demonstrates a critical feature of hash tables: 
Their contents are not ordered. To obtain an alphabetized printout of the concordance, we would 
have to sort it. 


THE TreeMap CLASS 


The TreeMap class extends the AbstractMap class and implements the SortedMap interface. 
It is called a tree map because its backing structure is a binary search tree instead of a hash table. 


160 


HASH TABLES [CHAP. 8 


Shakespeare.out 

ep 2 

AFTER=3 

Mtl Skills ish S10 

AM=29 

AMBITION=20, 25 
AMBITIOUS=6, 14, 18, 21, 26 


AN=10, 15, 22, 27 


ANSWER=8 
ARE=11 
ART=32 
BE=5, 20 
BEASTS=32 
BONES=4 
BROUGHT=16 
BRUTISH=32 


COME=2, 12 
COUNTRYMEN=1 

CRIED=19 

CROWN=24 

D=8 

DID=17, 18; 23, 25, 30 
DISPROVE=28 

DO=3, 29 

EARS=1 

EVIL=3 


Figure 8.4 The ordered concordance in Example 8.10 


But it is still a map with key/value entries. As a binary search tree structure, it sacrifices its O(1) 
access time, but its keys are ordered. 


EXAMPLE 8.10 An Ordered Concordance 


By replacing HashMap with TreeMap at line 2 in the Concordance class of Example 8.9, we obtain an 
ordered concordance: 


2 


private Map<String,String> map = new HashMap<String,String>Q() ; 


Part of the output from the same test program is shown in Figure 8.4. 


8.1 
8.2 
8.3 
8.4 
8.5 
8.6 
8.7 
8.8 


8.9 


Review Questions 


What is the difference between a table and a vector? 

Why is a table also called a map? 

Why is a table also called an associative array? 

Why is a table also called a dictionary? 

What is a concordance? 

What is a hash table? 

What is the difference between the Java Hashtab1e class and the Java HashMap class? 


The first two examples showed that the order of insertion into a hash table is irrelevant if 
there are no collisions. What if there are? 


What are the advantages and disadvantages of quadratic probing compared to linear probing? 
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8.10 What are the advantages and disadvantages of using a HashMap compared to a TreeMap? 


8.1 


8.2 


8.3 


8.4 


8.1 


8.2 


8.3 


8.4 


8.5 


Problems 


Run a program similar to the one in Example 8.1 on page 149 to insert the following 16 
entries into the German-English dictionary: 

map.putC"Ast","gate"); 

map.put("Eis","ice"); 

map.putC"Hof","court, yard, farm"); 


map.putC"Hut","hat") ; Words txt 
map.putC"Lob","praise") ; = 
map.putC"Mal","mark, signal"); AFTER 
map.put("Mut", "courage") ; ae 
map.putC"Ohr","ear"); AN 
map.put("Ost","east"); ae 
map.putC"Rad", "wheel") ; BE 
map.putC"Rat","advice, counsel"); aa 
map.put("Tag", "day"); COME 
map.put("Tor","gate"); ee 
map.putC"Uhr","clock"); FOR 
map.putC"Wal", whale") ; ioe 
map.put("Zug","procession, train"); HE 
Modify the Concordance class so that it filters out common words Te 
(pronouns, adverbs, etc.) whose listing would not contribute to new ae 
insights into the document. Store the common words in a separate oy 
file like the one shown in Figure 8.5. IT 
LET 
Modify the program in Example 8.1 on page 149 so that it stores the face 
words in alphabetical order. Have it load the same data as in Problem OF 
8.1 above and then print the table’s contents in alphabetical order. a 


Implement a FrequencyTable class for producing a list of words 


together with their frequency of occurrence in a specified text file. Ligure Ss ands 


Answers to Review Questions 


A vector provides direct access to its elements by means of its integer index. A table provides direct 
access to its elements by means of a key field, which can be of any ordinal type: int, double, string, 
and so forth. 


A table is also called a map because, like a mathematical function, it maps each key value into a 
unique element. 


A table is also called an associative array because it acts like an array (see Answer 8.1) in which each 
key value is associated with its unique element. Like a mathematical function, it maps each key value 
into a unique element. 


A table is also called a dictionary because it is used the same way as an ordinary natural language dic- 
tionary: to look up elements, as one would look up words in a dictionary. 


A concordance is a list of words that appear in a text document along with the numbers of the lines on 
which the words appear. (See page 157.) 
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8.6 


8.7 


8.8 
8.9 


8.10 


8.1 


8.2 
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A hash table is a table that uses a special function to compute the location of data values from their 
key values instead of storing the keys in the table. (See page 151.) 


Not much. The Java Hashtable class has generally been superseded by the Java HashMap class, 
which conforms a little better to the Java Collections Framework. 


If there are collisions, then the order of insertion is relevant. 


Quadratic probing generally results in fewer collisions because the probes jump over gaps in the index 
range. But unlike linear probing, quadratic probing can cause infinite loops even when the table is not 
full. 


A HashMap object is a hash table implemented with separate chaining and a default load threshold of 
75 percent, so it provides nearly O(1) access time for insertions, deletions, and searches. A TreeMap 
object is a balanced binary search tree implemented as a red-black tree, so it provides nearly O(lgn) 
access time for insertions, deletions, and searches. So a HashMap is faster, but a TreeMap is ordered. 


Solutions to Problems 


Inserting 16 entries into the German-English dictionary: 
public class Testing { 
public static void main(String[] args) { 

Map map = new HashMap(11) ; 
map.putC"Ast","branch") ; 
map.putC"Eis","ice"); 
map.putC"Hof","court, yard, farm"); 
map.putC"Hut","hat") ; 
map.putC"Lob","praise"); 
map.putC"Mal","mark, signal"); 
map.putC"Mut", "courage") ; 
map.putC"Ohr","ear"); 
map.putC"Ost","east"); 
map.putC"Rad", "wheel") ; 
map.putC"Rat","advice, counsel"); 
map.putC"Tag","day"); 
map.putC"Tor","gate"); 
map.putC"Uhr","clock") ; 
map.putC"Wal","whale") ; 
map.put("Zug","procession, train"); 
System.out.printInC"map="_+ map); 
System.out.printInC"map.keySetQ=" + map.keySetQ)); 
System.out.printInC"map.sizeQ=" + map.sizeQ); 


z 


" 


i 


A Concordance that filters out common words: 
public class Concordance { 
private Map<String,String> map = new TreeMap<String,String>(); 
private Set<String> words = new TreeSet<String>(); 


public Concordance(String source, String filter) { 
int lineNumber = 0; 
try { 
Scanner input = new Scanner(new File(filter)); 
while Cinput.hasNextLine()) { 
String line = input.nextLine(); 
StringTokenizer parser = new StringTokenizer(line) ; 
words.add(parser.nextToken() .toUpperCase()) ; 
} 
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input = new Scanner(new File(source)); 
while Cinput.hasNextLine()) { 
String line = input.nextLine(); 
++]ineNumber; 
StringTokenizer parser = new StringTokenizer(line,",.3;:Q-!?' "); 
while (parser.hasMoreTokens()) { 
String word = parser.nextToken().toUpperCase() ; 
if (words.contains(word)) { 
continue; 
} 
// insert lines 14-40 from Example 8.9 


8.3. A sorted German-English dictionary: 
public class TestDictionary { 

private static Map map; 

public static void main(String[] args) { 
map = new TreeMap(); 
load); 
dump () ; 

} 


private static void loadQ) { 
map.putC"Ast","branch"); 
map.putC"Eis","ice"); 
map.putC"Hof","court, yard, farm"); 
map.putC"Hut","hat") ; 
map.putC"Lob","praise"); 
map.putC"Mal","mark, signal"); 
map.putC"Mut", "courage") ; 
map.putC"Ohr","ear"); 
map.putC"Ost","east"); 
map.putC"Rad", "wheel") ; 
map.putC"Rat","advice, counsel"); 
map.putC"Tag","day"); 
map.putC"Tor","gate"); 
map.putC"Uhr", "clock") ; 
map.putC"Wal","whale") ; 


map.put("Zug","procession, train"); 


} 


private static void dumpQ) { 
Set set = map.entrySet(); 
for (Map.Entry<String,String> entry : map.entrySet(Q)) { 
System.out.printIn(Centry) ; 
} 
} 
} 


8.4 A frequency table: 
public class FrequencyTable { 
private Map<String,String> map = new TreeMap<String,String>(); 


public FrequencyTable(String file) { 
int lineNumber = 0; 
try { 
Scanner input = new Scanner(new File(file)); 
while Cinput.hasNextLine()) { 
String line = input.nextLine(); 
++]lineNumber; 
StringTokenizer parser = new StringTokenizer(line,",.3;:Q-!?' "); 
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while (parser.hasMoreTokens()) { 
String word = parser.nextToken().toUpperCaseQ() ; 
String frequency = (String)map.get(word) ; 
if (frequency==null) { 
frequency = "1"; 


} else { 
int n=Integer.parseInt(frequency) ; 
+4+n; 
frequency = "" +n; 

} 


map.put(word, frequency); 
// insert lines 21-40 from Example 8.9 


CHAPTER 9 


Recursion 


A recursive function is one that calls itself. This powerful technique produces repetition 
without using loops (e.g., whi le loops or for loops). Thus it can produce substantial results from 
very little code. Recursion allows elegantly simple solutions to difficult problems. But it can also 
be misused, producing inefficient code. Recursive code is usually produced from recursive 
algorithms. 


SIMPLE RECURSIVE FUNCTIONS 


EXAMPLE 9.1 The Factorial Function 


The factorial function is defined mathematically by 


0 1 
a | 1, ifn =0 
. 2 2 
n(n—-1)!,ifn>0 
3 6 
This is a recursive definition because the factorial “recurs” on the right side of the 4 24 
equation. The function is defined in terms of itself. 5 120 
The first 10 values of the factorial function are shown in Table 9.1. The first é 
value, 0!, is defined by the upper half of the definition: 0! = 1 (for n = 0). All the 
rest of the values are defined by the lower half of the definition: 7 
For n= 1, 1! =n! =n(n—1)!=101-1)! = 100)! = 10) = 1. 8 
For n = 2, 2! =n! =n(n—1)! = 2(2—1)! = 2(1)! = 21) = 2. 9 
For n = 3, 3! =n! = n(n—1)! = 3(3—-1)! = 3(2)! = 32) = 6. 
For n= 4, 4! =n! =n(n—1)! = 4(4-1)! = 4(3)! = 4(6) = 24. 
For n=5, 5! =n! = n(n—1)! = 5(5—1)! = 5(4)! = 5(24) = 120. 
Notice how rapidly this function grows. 


720 
5,040 
40,310 
362,880 


Table 9.1 Factorials 


EXAMPLE 9.2 Recursive Implementation of the Factorial Function 


When a function is defined recursively, its implementation is usually a direct translation of its recursive 
definition. The two parts of the recursive definition of the factorial function translate directly into two 
Java statements: 


1 public static int fCint n) { 
2 if (n==0) { 

3 return 1; // basis 
4 } 
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5 return n*f(n-1); // recursive part 
6 } 


Here is a simple test driver for the factorial method: 


1 public static void main(String[] args) { 

2 for Cint n=0; n<10; n++) { 

3 System.out.printInC"fC"+n+") = "+f(n)); 
4 } 

5 } 


It prints the same values as shown in Table 9.1. 


EXAMPLE 9.3 Iterative Implementation of the Factorial Function 


The factorial function is also easy to implement iteratively: 


1 public static int fCint n) { 

2 int f = 1; 

3 for Cint i = 2; i <= n; i++) f{ 
4 ee 

5 } 

6 return f; 


7 } 
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Note that the function header is identical to that used in Example 9.2; only the body is different. This 
allows us to use the same test driver for both implementations. The output should be the same. 


BASIS AND RECURSIVE PARTS 


To work correctly, every recursive function must have a basis and a recursive part. The basis 
is what stops the recursion. The recursive part is where the function calls itself. 


EXAMPLE 9.4 The Basis and Recursive Parts of the Factorial Function 


In the Java method that implements the factorial function in Example 
9.2, the basis and the recursive parts are labeled with comments. The 
recursive part invokes the method, passing a smaller value of n. So start- 
ing with a positive value like 5, the values on the successive invocations 
will be 4, 3, 2, 1, and 0. When 0 is passed, the basis executes, thereby 
stopping the recursion and beginning the chain of returns, returning 1, 1, 
2, 6, 24, and finally 120. 


EXAMPLE 9.5 The Fibonacci Numbers 


The Fibonacci numbers are 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, .... Each 
number after the second is the sum of the two preceding numbers. This is 
a naturally recursive definition: 


0,ifn=0 
F,=) l,ifn=1 
Ey DE yg os | 

The first 15 values of the Fibonacci sequence are shown in Table 9.2. 
The first two values, Fy and F\, are defined by the first two parts of the 
definition: F) = 0 (for n = 0) and F,, = 1 (for 7 = 1). These two parts form 
the basis of the recursion. All the other values are defined by the recursive 
part of the definition: 


n F, 
0 0 
1 1 
2 1 
3 2 
4 3 
3) 5 
6 8 
af 13 
8 21 
9 34 

10 55 
11 89 
12 144 
13 233 
14 377 


Table 9.2 Fibonacci numbers 


CHAP. 9] RECURSION 167 


For n= 2, F,=F,,=F, 


n- 


Forn=3,F,=F,=F, 


Forn=4, F,=F,=F, 
Forn=5,F;=F,,=F 


n—- 


ai = haat hop = ht ioa 10a L 
i og heyatiogs ole th alla 2, 
17 Sg at ha is ho 23; 
jhe = heat ha lat i= 32 = 5. 
Forn=6, Fy =F,=F, 4 a 
1 2 


Forn=7, FF, =F,=F, 


n 


SMe Ppa = lst iy=ot3=8. 
= Baya tt gig Pet ls = 8 tO = 13. 


EXAMPLE 9.6 Recursive Implementation of the Fibonacci Function 


1 public static int fibCint n) { 

2 if (n < 2) { 

3 return n; // basis 

4 } 

5 return fib(n-1) + fib(n-2); // recursive part 
6 } 


Here is a simple test driver for the Fibonacci method: 


1 public static void main(String[] args) { 

2 for Cint n = 0; n < 16; n++) f{ 

3 System.out.printInC"fibC" + n+") =" + fib(n)); 
4 } 

5 } 


It prints the same values as shown in Table 9.2. 
TRACING A RECURSIVE CALL 


Hand tracing the execution of a method usually helps clarify it. 
EXAMPLE 9.7 Tracing the Recursive Factorial Function 


Here is a trace of the call (5) to the recursive factorial function defined in Example 9.2: 


main() f(5) f(4) F(3) f(2) fd) 
n{5 5 eins 4 en 4 en | 3 || | 2 | RM 
120 24 6 2 1 


Figure 9.1 Tracing the recursive factorial function 


The call originates in the main() function, passing 5 to the FQ) function. There, the value of the parame- 
ter nis 5, so it calls f (4), passing 4 to the f() function. There the value of the parameter n is 4, so it calls 
(3), passing 3 to the FQ) function. This process continues (recursively) until the call f(1) is made 
from within the call f (2). There, the value of the parameter n is 1, so it returns 1 immediately, without 
making any more calls. Then the call £(2) returns 2*1 = 2 to the call £(3). Then the call f (3) returns 
3*2 = 6 to the call £(4). Then the call £(4) returns 4*6 = 24 to the call £(5). Finally, the call f (5) 
returns the value 120 to main(). 


The trace in Example 9.7 shows that the call f(n) to the recursive implementation of the 
factorial function will generate n—1 recursive calls. This is clearly very inefficient compared to 
the iterative implementation shown in Example 9.3. 


EXAMPLE 9.8 Tracing the Recursive Fibonacci Function 


The Fibonacci function (Example 9.6) is more heavily recursive than the factorial function (Example 
9.2) because it includes two recursive calls. The consequences can be seen from the trace of the call 
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fib(5), shown in Figure 9.2 on page 168. The call originates in the main() function, passing 5 to the 
fibQ) function. There, the value of the parameter n is 5, so it calls fib (4) and fib(3), passing 4 and 3, 
respectively. Each of these calls then makes two more recursive calls, continuing down to the basis calls 
f(1) and f (0). Each of these basis calls returns 1. The recursive calls then return the sum of the two 
values returned to them, ultimately resulting in the value 8 being returned to main(). 


main() : Fib(5) 


n}5 nls 
4 3 
fib(4) 5 3 fib(3) 
n|4 n}| 3 
3 2 2 1 
fib(3) fib(2) Fib(2) ; fib(1) 
2 1 
n| 3 3 é n| 2 n} 2 ee, nf} 
A 
2 4 1 1 0 1 4 0 
fib(2) fib(1) fib(1) Fib(0) fib(1) : Fib(0) 
2 1 al 1 1: 1 
n} 2 ry nf} 1 n} 1 _y n}|0O n} 1 Ey n}|0O 
1 0 
fib(1) fib(0) 
Vy \ 
n{1 Py n|0 


Figure 9.2 Tracing the recursive Fibonacci function 


THE RECURSIVE BINARY SEARCH 


The nonrecursive binary search algorithm is given on page 31. It uses the divide-and-conquer 
strategy, each time splitting the sequence in half and continuing the search on one half. This is 
naturally recursive. 


EXAMPLE 9.9 The Recursive Binary Search 


Here is the recursive binary search algorithm: 
(Precondition: s = {so, 5), . . ., S,} 1S a sorted sequence of 7 ordinal values of the same type as x.) 
(Postcondition: either the index i is returned where s; = x, or —1 is returned.) 
1. Ifthe sequence is empty, return —1. 
2. Lets, be the middle element of the sequence. 
3. Ifs;=x, return its index i. 
4. Ifs,;<x, apply the algorithm on the subsequence that lies above s;. 
5. Apply the algorithm on the subsequence of s that lies below s;. 
It is implemented in Example 9.10. 


The recursive binary search runs in O(lgn) time. The running time is proportional to the 
number of recursive calls made. Each call processes a subsequence that is half as long as the 
previous one. So the number of recursive calls is the same as the number of times that 7 can be 
divided in two, namely lgn. 
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EXAMPLE 9.10 Testing the Recursive Binary Search 


1 public class TestBinarySearch { 
2 public static void main(String[] args) { 
3 int{] a = {22, 33, 44, 55, 66, 77, 88, 99}; 
4 print(a); 
5 System.out.printInC"search(a, 44): " + search(a, 44)); 
6 System.out.printInC"search(a, 50): " + search(a, 50)); 
7 System.out.printInC"search(a, 77): " + search(a, 77)); 
8 System.out.printInC"search(a, 100): " + search(a, 100)); 
9 } 
10 
1 public static void printCint[] a) f{ 
12 System.out.printfC"{%d", a[0]); 
13 for Cint i = 1; 1 < a.length; i++) { 
14 System.out.printfC", %d", a[i]); 
15 
16 System.out.printIn("}"); 
17 } 
18 
19 public static int search(int[] a, int x) { 
20 return search(a, 0, a.length-1, x); 
21 
22 
23 public static int searchCint[] a, int lo, int hi, int x) { 
24 // PRECONDITION: afO] <= a[1] <= ... <= a[a.length-1]; 
25 // POSTCONDITIONS: returns i; 
26 // if i >= 0, then ali] == x; otherwise i == -1; 
27 if Clo > hi) { 
28 return -1; // basis 
29 } 
30 int i = Clo + hi)/2; 
31 if (ali] == x) { 
32 return 1; 
33 } else if Cali] < x) { 
34 return search(a, i+1, hi, x); 
35 } else { 
36 return search(a, lo, i-1, x); 
37 } 
38 } 
39 } 
The output is: 


{22, 33, 44, 55, 66, 77, 88, 99} 

search(a, 44): 2 

search(a, 50): -1 

search(a, 77): 5 

search(a, 100): -1 

The search() method returns the index of the target x: search(a, 44) returns 2 because a[2] = 44 

and search(a, 77) returns 5 because a[5] = 77. The method returns —1 when the target is not in the 
array: search(a, 50) returns —1 because 50 is not in the array. 


BINOMIAL COEFFICIENTS 


The binomial coefficients are the coefficients that result from the expansion of a binomial 
expression of the form (x + 1)”. For example, 
(x + 1)° =x + 6x° + 15x4 + 20x3 + 15x? + 6x41 
The seven coefficients generated here are 1, 6, 15, 20, 15, 6, and 1. 
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The French mathematician Blaise Pascal (1623-1662) discovered a recursive relationship 
among the binomial coefficients. By arranging them in a triangle, he found that each interior 
number is the sum of the two directly above it. (See Figure 9.3.) For example, 15 =5 + 10. 

Let c(n,k) denote the coefficient in row number n and column number & (counting from 0). For 
example, c(6,2) = 15. Then Pascal’s recurrence relation can be expressed as 

c(n, k) = c(n-1, k-1) + c(n-1, k), forO<k<n 
For example, when 7 = 6 and k = 2, c(6,2) = c(5,1) + c(5,2). 


EXAMPLE 9.11 Recursive Implementation of the Binomial Coefficient Function 


public static int cCint n, int k) { 


1 

2 if Ck==0 || k==n) { 

3 return 1; // basis 

4 } 

5 return c(n-1,k-1) + c(n-1,k); // recursion 
6 } 


The basis for the recursion covers the left and right sides of the triangle, where k = 0 and where k = n. 


Column 2 


Figure 9.3 Pascal’s triangle 


The binomial coefficients are the same as the combination numbers used in combinatorial 
mathematics and computed explicitly by the formula 


acy = n! 7 (A\()(4) od 
: ki(n—k)! 1 2 37°" k 
In this context, the combination is often written c(n, k) = @ and is pronounced “n choose k.” 
For example, “8 choose 3” is (3) = (8/1)(7/2)(6/3) = 56. 


k 


EXAMPLE 9.12 Iterative Implementation of the Binomial Coefficient Function 


This version implements the explicit formula given above. The expression on the right consists of k 
factors, so it is computed by a loop iterating & times: 

1 public static int cCint n, int k) { 

2 if (n<2 || k==01/| k==n { 

3 return 1; 

4 } 

5 int c=1; 
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6 for (int j = 1; j <= k; j++) f{ 

7 c = c*(n-j+1)/j; 

8 } 

9 return Cc; 

10 } 
THE EUCLIDEAN ALGORITHM 494 

—13 
The Euclidean Algorithm computes the greatest a 

common divisor of two positive integers. Appearing as 73 4 
Proposition 2 in Book VII of Euclid’s Elements (c. 300 3 0 130 
B.C.), it is probably the oldest recursive algorithm. As 10 ae 10 4 104 
originally formulated by Euclid, it says to subtract 26 26 
repeatedly the smaller number 7 from the larger number 78 
m until the resulting difference d is smaller than n. Then _26 
repeat the same steps with d in place of n and with 7 in 52 
place of m. Continue until the two numbers are equal. 26 
Then that number will be the greatest common divisor 2%6 


of the original two numbers. 

Figure 9.4 applies this algorithm to find the greatest 
common divisor of 494 and 130 to be 26. This is correct 
because 494 = 26-19 and 130 = 26-5. 


Figure 9.4 The Euclidean algorithm 


EXAMPLE 9.13 Recursive Implementation of the Euclidean Algorithm 


Each step in the algorithm simply subtracts the smaller number from the larger. This is done recursively 
by calling either gcd(m,n-m) or gcd(m-n,n): 
public static int gcdCint m, int n) { 


1 

2 if (m==n) { 

3 return n; // basis 

4 } else if (m<n) { 

5 return gcd(m,n-m) ; // recursion 
6 } else { 

7 return gcd(m-n,n); // recursion 
8 } 

9 } 


For example, the call gcd(494,130) makes the recursive call gcd(364,130), which makes the 
recursive call gcd(234, 130), which makes the recursive call gcd(104, 130), which makes the recur- 
sive call gcd(104, 26), which makes the recursive call gcd(78, 26), which makes the recursive call 
gcd(52,26), which makes the recursive call gcd(26, 26), which returns 26. The value 26 is then 
successively returned all the way back up the chain to the original call gcd (494 , 130), which returns it 
to its caller. 


INDUCTIVE PROOF OF CORRECTNESS 


Recursive functions are usually proved correct by the principle of mathematical induction. 
This principle states that an infinite sequence of propositions can be proved to be true by verify- 
ing that (7) the first statement is true, and (ii) the truth of every other statement in the sequence 
can be derived from the assumption that its preceding statements are true. Part (7) is called the 
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basis step and part (ii) is called the inductive step. The assumption that the preceding statements 
are true is called the inductive hypothesis. 
The recursive factorial function is correct. To prove this fact, we first verify the basis. The 
call £(0) returns the correct value 1 because of the first part: 
if (n < 2) { 
return 1; 
: 


Next, we assume that the function returns the correct value for all integers less than some n > 0. 
Then the second part 

return n*f(n-1); 
will return the correct value n! because (by the inductive hypothesis) the call f(n-1) will return 
(n—1)! and n! =n-(n—1). 

Note that we are using the “strong” principle of mathematical induction here (also called the 
second principle of mathematical induction). In this version, the inductive hypothesis allows us 
to assume that a// the preceding statements are true. In the “weak” (or “first’’) principle, we are 
allowed to assume that only the single preceding statement is true. But since these two principles 
are equivalent (i.e., they are both valid methods of proof), it is usually better to apply strong 
induction. 

The Euclidean algorithm is correct. We can use (strong) induction to prove this fact. (See 
page 322.) If m and n are equal, then that number is their greatest common divisor. So the 
function returns the correct value in that case because of the part 

if (m == n) { 
return n; 


If m and n are not equal, then the function returns either gcd(m,n-m) or gcd(m-n,n). To see 
that this too is the correct value, we need only realize that all three pairs (m,n), (m,n-m), and 
(m-n,n) will always have the same greatest common divisor. This fact is a theorem from 
number theory. 


COMPLEXITY ANALYSIS 


The complexity analysis of a recursive algorithm depends upon the solubility of its recurrence 
relation. The general technique is to let 7(n) be the number of steps required to carry out the 
algorithm on a problem of size n. The recursive part of the algorithm translates into a recurrence 
relation on 7(7). Its solution is then the complexity function for the algorithm. 

The recursive factorial function runs in O(n) time. Let 7(”) be the number of recursive 
calls made from the initial call f(n) to the function in Example 9.2 on page 165. Then 
T(0) = T(1) = 0, because if m < 2, no recursive calls are made. If 7 > 1, then the line 

return n*f(n-1); 

executes, making the recursive call f (n-1). Then the total number of recursive calls is 1 plus the 
number of calls that are made from f(n-1). That translates into the recurrence relation 

T(n) = 1+ T(n-1) 

The solution to this recurrence is 

T(n) =n-1, forn>0 
This conclusion is obtained in two stages: First we find the solution; then we use induction to 
prove that it is correct. The simplest technique for finding the solution to a recurrence relation is 
to make a table of values and look for a pattern. This recurrence relation says that each value of 
T(n) is 1 more than the previous value. So the solution f(7) =-—1 is pretty obvious. 
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Now to prove that 7(n) = n—1 for all n > 0, let {7) = n—1 and apply the (weak) principle of 
mathematical induction. The basis case is where n = 1. In that case, 7(n) = T(1) = 0 and f(n) = f(1) 
= (1) — 1 =0. For the inductive step, we assume that 7(n) = f(n) for some n > 0 and then deduce 
from that assumption that 7(n+1) =f(n+1): 

Tin+l)=14+7(n)=14+f(n)=14+(n-l)=n 
fintl)=(nt+l)-l=n 
That completes the proof. 

Now that we have determined that the complexity function for this recursive implementation 
of the factorial function 7(n) = n—1, we can conclude that this implementation “will run in O(7) 
time.” This means that its execution time will be proportional to the size of its argument zn. If it 
takes 3 milliseconds to compute 8!, then it should take about 6 milliseconds to compute 16!. 


DYNAMIC PROGRAMMING 


In most cases, recursion is very inefficient because of its frequent function calls. So an itera- 
tive implementation may be better if it is not too complex. Another alternative is to implement 
the recurrence relation by storing previously computed values in an array instead of recomputing 
them with recursive function calls. This method is called dynamic programming. 


EXAMPLE 9.14 Dynamic Programming Implementation of the Fibonacci Function 


public static int fibCint n) { 


1 

2 if (n < 2) { 

3 return n; 

4 

5 intl] f = new int[n]; 

6 f[0] = 0; 

7 f[1] = 1; 

8 for Cint i=2; i<n; i++) f{ // store the Fibonacci numbers 
9 fli] = f[i-1] + f[i-2]; 


1 return f[n-1] + f[n-2]; 
12 } 


This implementation uses a dynamic array f[] of n integers to store the first n Fibonacci numbers. 
THE TOWERS OF HANOI 


We have seen important examples of functions that are more naturally defined and more easily 
understood by using recursion. For some problems, recursion is the only reasonable method of 
solution. 

The Towers of Hanoi puzzle is a classic A 
example of a problem whose solution demands ss | | 
recursion. The game consists of a board with ri Vi 
three vertical pegs labeled A, B, and C, and a 
sequence of n disks with holes in their centers. Figure 9.5 The Towers of Hanoi puzzle 
(See Figure 9.5.) The radii of the disks are in an 
arithmetic progression (e.g., 5cm, 6cm, 7cm, 8cm, .. .) and are mounted on peg A. The rule is 
that no disk may be above a smaller disk on the same peg. The objective of the game is to move 
all the disks from peg A to peg C, one disk at a time, without violating the rule. 


B Cc 
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The general solution to the Towers of Hanoi game is naturally recursive: 
¢ Part I: Move the smaller n—1 disks from peg A to peg B. 
* Part II: Move the remaining disk from peg A to peg C. 
* Part III: Move the smaller n—1 disks from peg B to peg C. 
The first and third steps are recursive: Apply the complete solution to n—1 disks. The basis to this 
recursive solution is the case where 7 = 0. In that case, do nothing. 
The solution for the case of n = 1 disk is: 
1. Move the disk from peg A to peg C. 
The solution for the case of 7 = 2 disks is: 
1. Move the top disk from peg A to peg B. 
2. Move the second disk from peg A to peg C. 
3. Move the top disk from peg B to peg C. 
The solution for the case of 7 = 3 disks is: 
1. Move the top disk from peg A to peg C. 
Move the second disk from peg A to peg B. 
Move the top disk from peg C to peg B. 
Move the remaining disk from peg A to peg C. 
Move the top disk from peg B to peg A. 
Move the second disk from peg B to peg C. 
7. Move the top disk from peg A to peg C. 
Here, steps 1—3 constitute Part I of the general solution, step 4 constitutes Part II, and steps 5—7 
constitute Part III. 
Since the general recursive solution requires the substitution of different peg labels, it is better 
to use variables. Then, naming this three-step algorithm hanoi(n, x, y, z), it becomes: 
* Part I: Move the smaller n—1 disks from peg x to peg z. 
¢ Part II: Move the remaining disk from peg x to peg y. 
¢ Part II: Move the smaller n—1 disks from peg z to peg y. 
The general solution is implemented in Example 9.15. 


GN NS 


EXAMPLE 9.15 The Towers of Hanoi 


This program prints the solution to the Towers of Hanoi problem of moving three disks from peg A to 


peg C via peg B: 
1 public class TestHanoiTowers { 
public static void main(String[] args) { 


2 

3 HanoiTowers(3, 'A', 'B', 'C'); 

4 } 

5 

6 public static void HanoiTowers(int n, char x, char y, char z) { 
7 if (n==1) { // basis 

8 System.out.printfC"Move top disk from peg %c to peg %c.%n", Xx, Z); 
9 } else { 

10 HanoiTowers(n-1, x, z, y); // recursion 

1 HanoiTowers(1, x, y, Z)3 // recursion 

12 HanoiTowers(n-1, y, x, z); // recursion 

13 } 

14 } 
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The output is: 
Move top disk from peg A to peg C. 
Move top disk from peg A to peg B. 
Move top disk from peg C to peg B. 
Move top disk from peg A to peg C. 
Move top disk from peg B to peg A. 
Move top disk from peg B to peg C. 
Move top disk from peg A to peg C. 


To solve the problem for three disks, the call at line 3 passes 3 ton, 'A' to x, 'B' to y, and 'C' to z. 
Since n> 1, line 10 executes next, passing 2 ton, 'A' tox, 'B' to z, and 'C' to y. Again, since n> 1, line 
10 executes next, passing | ton, 'A' to x, 'B' to y, and 'C' to z. In that call, n = 1, so line 8 executes, 
printing the first line of output: 

Move top disk from peg A to peg C. 

That call returns to where the previous call left off at line 10, proceeding to line 11, where n = 2, 

x= 'A',y='C', and z= 'B'. That prints the second line of output: 

Move top disk from peg A to peg B. 
Then line 12 executes, this time passing | ton, 'C' to x, 'A' to y, and 'B' to z. In that call, n= 1 again, so 
line 8 executes, printing the third line of output: 

Move top disk from peg C to peg B. 

That call returns to where the second recursive call had left off at line 12. Since that is the last execut- 
able statement in the method, it also returns, back to where the first recursive call had left off at line 10. So 
it proceeds to line 11 with n=3, x= 'A', y= 'B', and z= 'C'. That prints the fourth line of output: 

Move top disk from peg A to peg C. 
Then line 12 executes, passing 2 ton, 'B' tox, 'A' toy, and 'C' toz. 

That call, HanoiTowers(2, 'B', 'A', 'C'), recursively moves the stack of two disks from peg B to 
to peg C via peg A and generates the last three lines of output: 

Move top disk from peg B to peg A. 

Move top disk from peg B to peg C. 

Move top disk from peg A to peg C. 
Since the previous four moves had already transferred the largest disk from peg A to peg C, this completes 
the task. 


MUTUAL RECURSION —) 
When a function calls itself, it is called direct Direct recursion aa 


recursion. Another form of recursion is when a 

function calls other functions that call other 

functions that eventually call the original function. ae. = 
This is called indirect recursion. Its most common ee 
form is when two functions call each other. This is 

called mutual recursion. (See Figure 9.6.) Figure 9.6 Types of recursion 


Indirect recursion 


Mutual recursion 


EXAMPLE 9.16 The Sine and Cosine Functions Computed by Mutual Recursion 


The sine and cosine functions from trigonometry can be defined in several different ways, and there are 
several different algorithms for computing their values. The simplest (although not the most efficient) is 
via mutual recursion. It is based upon the identities: 

sin2@=2sin6 cos@ 
cos20 = 1—2(sind)” 
and the two Taylor polynomials: 


sinx = x —x°/6 


17 


6 


RECURSION 


cosx x 1 —x2/2 


which are close approximations for small values of x. 
public class TestMutualRecursion { 

public static void main(String[] args) { 

String fmt1 = "%18s%18s%18s%n" ; 


1 


oa nnn Fr wo ND 


String fmt2 = 
System.out.printf(fmt1, "s(x) 
for (double x = 


} 


System.out.printfCfmt1, "c(x) 
for (double x = 


i 
} 


public static double s(double x) { 
if (-0.005 < x & x < 0.005) { 


return xX - x*x*x/6; 


} 
return 2*S(x/2)*c(x/2); 


} 


// basis 


public static double c(double x) { 
if (-0.005 < x & x < 0.005) { 


return 1.0 - x*x/2; 


} 
} 


return 1 - 2*s(x/2)*s(x/2); 


} 
The output is: 


ooooocooococo°coo 


oOoOOCOCCOCOCOCOCOF 


s(x) 


- 0000000000000 
- 0998334166464 
- 1986693307941 
- 2955202066544 
- 3894183423069 
- 4794255385991 
- 5646424733831 
- 6442176872362 
- 7173560908969 
- 7833269096232 
- 8414709848016 


c(x) 


- 0000000000000 
- 9950041652781 
- 9800665778414 
- 9553364891277 
- 9210609940036 
- 8775825618932 
- 8253356149179 
. 7648421872857 
- 6967067093499 
- 6216099682760 
- 5403023058779 


eo ooo ooR eRe eoR@) 


OOOCOCCOCCOCCOCCOCOOF 


// basis 


Math. sin(x) 


. 0000000000000 
- 0998334166468 
. 1986693307951 
- 2955202066613 
- 3894183423087 
- 4794255386042 
- 5646424733950 
. 6442176872377 
. 7173560908995 
- 7833269096275 
. 8414709848079 


Math. cos(x) 


. 0000000000000 
- 9950041652780 
- 9800665778412 
- 9553364891256 
- 9210609940029 
- 8775825618904 
- 8253356149097 
. 7648421872845 
- 6967067093472 
- 6216099682707 
- 5403023058681 


"%18 .13F%18 .13F%18 .13F%n" ; 


» "Math.sin(x) " 


1) f{ 
sin(x), Math.sin(x) - s(x)); 


0.0; x < 1.0; x += 0. 
System.out.printfCfmt2, s(x), Math. 


0.0; x < 1.0; x += 0. 
System.out.printfCfmt2, cCx), Math. 


// recursion 


"Math.cos(x) " 
1) { 
cos(x), c(x) - Math.cos(x)); 


ooooqoooocoocoo°co 


oo0o0coocooocoocoococo 


// recursion 


error 


. 0000000000000 
. 0000000000005 
. 0000000000009 
. 0000000000069 
. 0000000000018 
. 0000000000051 
. 0000000000120 
. 0000000000015 
. 0000000000027 
. 0000000000043 
. 0000000000063 


error 


. 0000000000000 
. 0000000000000 
. 0000000000002 
. 0000000000021 
. 0000000000007 
. 0000000000028 
. 0000000000082 
. 0000000000013 
. 0000000000027 
. 0000000000054 
. 0000000000098 


"error 


"error 
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This works because on each recursive call x is divided by 2, and eventually it reaches the basis criterion 
(-0.005 < x && x < 0.005), which stops the recursion. 


CHAP. 9] RECURSION 177 


9.1 


9.2 


9.3 


9.4 


9.5 


9.1 


9.2 
9.3 
9.4 


9.5 


9.6 
9.7 


9.8 


9.9 


9.10 


9.11 


9.12 


9.13 


Review Questions 


A recursive function must have two parts: its basis and its recursive part. Explain what each 
of these is and why it is essential to recursion. 


How many recursive calls will the call £(10) to the recursive factorial function (Example 9.2 
on page 165) generate? 


How many recursive calls will the call fib(6) to the recursive Fibonacci function (Example 
9.6 on page 167) generate? 


What are the advantages and disadvantages of implementing a recursive solution instead of 
an iterative solution? 


What is the difference between direct recursion and indirect recursion? 
Problems 


Write and test a recursive function that returns the sum of the squares of the first 7 positive 
integers. 


Write and test a recursive function that returns the sum of the first 7 powers of a base b. 
Write and test a recursive function that returns the sum of the first 7 elements of an array. 


Write and test a recursive function that returns the maximum among the first 7 elements of an 
array. 


Write and test a recursive function that returns the maximum among the first 7 elements of an 
array, using at most lg 7 recursive calls. 


Write and test a recursive function that returns the power x”. 


Write and test a recursive function that returns the power x”, using at most 2 lg recursive 
calls. 


Write and test a recursive function that returns the integer binary logarithm of an integer 1 
(i.e., the number of times 7 can be divided by 2). 


Write and test a recursive boolean function that determines whether a string is a palindrome. 
(A palindrome is a string of characters that is the same as the string obtained from it by 
reversing its letters.) 


Write and test a recursive function that returns a string that contains the binary representation 
of a positive integer. 


Write and test a recursive function that returns a string that contains the hexadecimal repre- 
sentation of a positive integer. 


Write and test a recursive function that prints all the permutations of the first 7 characters of a 
string. For example, the call printC"ABC", 3) would print 

ABC 

ACB 

BAC 

BCA 

CBA 

CAB 


Implement the Fibonacci function iteratively (without using an array). 
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9.14 


9.15 
9.16 


9.17 
9.18 


9.19 


9.20 


9.21 
9.22 
9.23 
9.24 
9.25 
9.26 
9.27 


9.28 


9.29 


9.30 
9.31 
9.32 


9.33 


9.34 
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Implement the recursive Ackermann function: 
A(O, n)=n+ 1 

A(m, 0) = A(m — 1, 1), if m > 0 

A(m, n) = A(m— 1, A(m, n—1)), ifm > 0 andn > 0 


Prove Pascal’s recurrence relation (page 170). 


Trace the recursive implementation of the Euclidean Algorithm (Example 9.13 on page 171) 
on the call gcd(385, 231). 


Implement the Euclidean Algorithm (page 171) iteratively. 


Implement the recursive Euclidean Algorithm using the integer remainder operator % instead 
of repeated subtraction. 


Implement the Euclidean Algorithm iteratively using the integer remainder operator % 
instead of repeated subtraction. 


Use mathematical induction to prove that the recursive implementation of the Fibonacci 
function (Example 9.6 on page 167) is correct. 


se mathematical induction to prove that the recursive function in Problem 9.4 is correct. 


se mathematical induction to prove that the recursive function in Problem 9.8 is correct. 


U 
Use mathematical induction to prove that the recursive function in Problem 9.5 is correct. 
U 
U 


se mathematical induction to prove that the recursive function in Problem 9.12 is correct. 


The computable domain of a function is the set of inputs for which the function can produce 
correct results. Determine empirically the computable domain of the factorial function imple- 
mented in Example 9.2 on page 165. 


Determine empirically the computable domain of the sum(b,n) function implemented in 
Problem 9.2 on page 177, using b = 2. 


Determine empirically the computable domain of the Fibonacci function implemented in 
Example 9.3 on page 166. 


Determine empirically the computable domain of the recursive binomial coefficient function 
(Example 9.11 on page 170). 


The Towers of Hanoi program performs 7 disk moves for 3 disks. How many disk moves are 
performed for: 


a. 5 disks? 
b. 6 disks? 
ce. n disks? 


Prove the formula that you derived in previous problem. 

Determine empirically the computable domain of the Ackermann function (Problem 9.14). 
Show the recursive call tree for the call hanoi(4,'A','B','C') in Example 9.15 on page 
174. 


Modify the program in Example 9.16 on page 175 so that the results are more accurate by 
narrowing the bases so that recursion continues until |x| < 0.00005. 


Modify the program in Example 9.16 on page 175 so that the results are obtained in fewer 
iterations by using the more accurate Taylor approximations 

sinx  x—x3/6 + x°/120 

cosx © 1 —x7/2 +x4/24 
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9.35 


9.36 


9.37 


9.1 


9.2 
9.3 


9.4 


9.5 


9.1 


9.2 


Use these formulas to implement the hyperbolic sine and hyperbolic cosine functions recur- 
sively: 

sinh2x = 2sinhxcoshx 

cosh2x = 1 + 2(sinhx)* 

sinx =x + x3/6 

cosx = 1+x7/2 

Compare your results with the corresponding values of the Math. sinh() and Math. cosh() 
methods. 


Use these trigonometric formulas to implement the tangent function recursively: 
tan20 = 2tand /(1 — tan*0) 

tanx »x+x/3 

Compare your results with the corresponding values of the Math. tan() method. 


Implement a recursive function that evaluates a polynomial a) + a,x + ax? +--+ + a,x, 


where the 7+1 coefficients a; are passed to the function in an array along with the degree n. 


Answers to Review Questions 


The basis of a recursive function is its starting point in its definition and its final step when it is being 
called recursively; it is what stops the recursion. The recursive part of a recursive function is the 
assignment that includes the function on the right side of the assignment operator, causing the function 
to call itself; it is what produces the repetition. For example, in the factorial function, the basis is n! = 
1 if m = 0, and the recursive part is nm! =n(n—-1) ifn>0. 


The call factorial (10) will generate 10 recursive calls. 


The call £(6) to the Fibonacci function will generate 14 + 8 = 22 recursive calls because it calls f (5) 
and f(4), which generate 14 and 8 recursive calls, respectively. 


A recursive solution is often easier to understand than its equivalent iterative solution. But recursion 
usually runs more slowly than iteration. 


Direct recursion is where a function calls itself. Indirect recursion is where a group of functions call 
each other. 


Solutions to Problems 


A recursive function that returns the sum of the first 1 squares: 
int sumCint n) { 


if (n == 0) { 
return 0; // basis 
} 
return sum(n-1) + n*n; // recursion 


} 


A recursive function that returns the sum of the first 7 powers of a base b: 
double sum(double b, int n) { 


if (n == 0) { 
return 1; // basis 
} 
return 1 + b*sum(b,n-1); // recursion 


} 
Note that this solution implements Horner’s method: 1 + b*(1 + b*(1 + b*(1 +--+ +))). 
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9.3 A recursive function that returns the sum of the first 1 elements of an array: 
double sum(double[] a, int n) { 


if (n == 0) { 
return 0.0; // basis 
} 
return sum(a,n-1) + a[n-1]; // recursion 


i 


9.4 A recursive function that returns the maximum among the first 7 elements of an array: 
double max(double[] a, int n) { 


if (n == 1) { 
return a[0]; // basis 
} 
double m = max(a,n-1); // recursion 


if (a[n-1] > m) { 
return a[n-1]; 
} else { 
return m; 
} 
} 


9.5 A recursive function that returns the maximum among the first n elements of an array and makes no 


more than lgv recursive calls: 
double max(double[] a, int lo, int hi) { 

if Clo >= hi) { 

return a[lo]; 


} 
int mid = Clo + hi)/2; // middle index 
double m1 = max(a, lo, mid); // recursion on a[lo..mid] 
double m2 = max(a, mid + 1, hi); // recursion on a[mid+1..hi] 
return (ml>m2? m1: m2); // maximum of {m1,m2} 

} 


9.6 A recursive function that returns the power x”: 
double pow(double x, int n) { 


if (n == 0) f{ 
return 1.0; // basis 
} 
return x*pow(x,n-1); // recursion 
} 


9.7. A recursive function that returns the power x” and makes no more than lgv recursive calls: 
double pow(double x, int n) { 
if (n == 0) { 
return 1.0; // basis 
} 
double p = pow(x,n/2); 
if (n%2 == 0) { 


return p*p; // recursion (Cn even) 
} else { 
return x*p*p; // recursion (Cn odd) 


} 
} 


9.8 A recursive function that returns the integer binary logarithm of n: 
int IgCint n) { 


if (n == 1) { 
return 0; // basis 
} 
return 1 + lg(n/2); // recursion 


} 


CHAP. 9] RECURSION 181 


9.9 A recursive function that determines whether a string is a palindrome: 
boolean isPalindrome(String s) { 
int len = s.lengthQ); 
if Clen < 2) { 


return true; // basis 
} else if (s.charAt(O) != s.charAt(len-1)) 
return false; // basis 


} else if (len == 2) { 
return true; 
} else { // basis 
return isPalindrome(s.substring(1,len-1)); // recursion 
} 
} 


9.10 A recursive function that converts decimal to binary: 
String binaryCint n) { 
String s; 
if (n%2 == 0) { 
s= "0"; 
} else { 
SS Ls 
} 
if (n < 2) { 
return s; // basis 
} 
return binary(n/2) + s; // recursion 


a 


9.11 A recursive function that converts decimal to hexadecimal: 
String hexadecimal(int n) { 
if (n < 16) { 
return Integer. toString(n%16) 
} 
return hexadecimal(n/16) + s; // recursion 


i; 


9.12 A recursive function that prints permutations: 
void print(String str) { 
printC"",str); 
} 


void print(String left, String right) { 
int n = right. lengthQ); 

if (n == 0) { 
return; 

} else if (n == 1) 
System.out.printInCleft+right) ; 
return; 

} 

StringBuilder buf = new StringBuilder(right) ; 

for Cint i = 0; i < n; i++) f{ 
char temp = s.charAt(i); 
s.setCharAt(i, s.charAt(0)); 
s.setCharAt(O, temp); 
printCleft+temp, s.substring(1, n)); 

} 

} 
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9.13 Iterative implementation of the Fibonacci function: 
int fibCint n) { 
if (n < 2) { 
return n; 


} 
int f0 = 0, fl =1, f = f0+f1; 
for Cint i = 2; i <n; i++) { 


3 


9.14 The Ackermann function: 
int ackermannCint m, int n) { 


if (m == 0) f{ 

return n + 1; // basis 
} else if (n == 0) { 

return ackermann(m - 1, 1); // basis 
} else { 


return ackermann(m - 1, ackermann(m, n - 1)); // recursion 


} 
9.15 Consider the relationship c(8,3) = 56 = 35 + 21 =c(7,3) + c(7,2) from the expansion of (x + 1)*: 
(x+ 18+ D+)’ 
(x + I(x? + 7x6 + 21x95 + 35x4 + 35x93 + 21x? + Tx +1) 
Te PQ aS 5g ti Ta oe 
t x? + 7x6 +21x9 + 35x4 + 35x39 + 21x27 + Tx +1 
x8 + 8x7 + 28x° + 56x° + 70x4 + 56x3 + 28x? + 7x + 1 
The coefficient c(8,3) is for the x° term, which is 35x° + 21x° = 56x>. The sum 35x° + 21x° came from 
(x)(35x*) and (1)(21x°). So those coefficients are 35 = c(7,3) and 21 =c(7,2). 
The general proof is based upon the same argument: c(n,k) is the coefficient of the term x* in the 


expansion of (x + 1)”. Since (x + 1)” =(x + 1)(x + 1)”~!, that term comes from the sum 
(x)(e(n=1, k= 1) x*-!) + (A)(e(n=1, Wx) = (e(n=1, k= 1) + e(n= 1, bk) x* 
Therefore c(n, k) =c(n—1, k-1) + c(n—-1, Bh). 
9.16 Figure 9.7 shows the trace of the call gcd(616, 231): 


main() gcd(385,231) gcd(154, 231) gcd(154,77) gcd(77,77) 
m|385 m|{385 m{154 m|154 m| 77 
n{oaT 385,231 n 1231 154,231 nl231 154,77 nlv7l 77,77 nl77] 
77 77 we 77 77 


Figure 9.7 Tracing the Euclidean algorithm 


9.17 Iterative implementation of the Euclidean algorithm: 
int gcd(int m, int n) { 


while (m != n) { // INVARIANT: gcd(m,n) 
if (m<n) { 
n-= Mm; 
} else { 
m -= Nn; 
} 
} 
return n; 


} 
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9.18 


9.19 


9.20 


9.21 


9.22 


9.23 


9.24 


Recursive implementation of the Euclidean algorithm using the remainder operator: 
int gcdCint m, int n) { 


if (m == 0) { 

return n; // basis 
} else if (n == 0) { 

return m; // basis 
} else if (m < n) { 

return gcd(m, n%m); // recursion 
} else { 

return gcd(m%n, n); // recursion 
} 


i 


Iterative implementation of the Euclidean algorithm using the remainder operator: 
int gcdCint m, int n) { 
while (n > 0) { // INVARIANT: gcd(m,n) 
int r = m%n; 


m=n; 
n=r3 

} 

return m; 


} 


To prove that the recursive implementation of the Fibonacci function is correct, first verify the basis. 
The calls fib(0) and fib(1) return the correct values 0 and 1 because of the first line 

if (n< 2) { 

return n; 

} 
Next, we assume that the function returns the correct value for all integers less than some n > 1. Then 
the second line 

return fib(n-1) + fib(n-2); 
will return the correct value because (by the inductive hypothesis) the calls fib(n-1) and fib(n-2) 
return the correct values for F’,_, and F,,_,, respectively, and F,, = F’,_, + F,,_, by definition. Note that 
the basis here requires the verification of the first two steps in the sequence because the recurrence 
relation F,, = F,,_, + F,,_, applies only for > 1. 


If n = 1, then the basis executes, returning a[0] which is the maximum element because it is the only 
element. If 7 > 1, then the function correctly computes the maximum m of the first v-1 elements (by 
the inductive hypothesis). If the condition (a[n-1] > m) is true, then that element a[n-1] is returned, 
and it is the largest because it is larger than m, which is larger than all the others. On the other hand, if 
the condition (a[n-1] > m) is false, then m is returned, and that is the largest because it is not smaller 
than a[m-1], and it is the largest among all the others. 


If n = 1, then the basis executes, returning a[0] which is the maximum element because it is the only 
element. If 7 > 1, then the function correctly computes the maxima m1 and m2 of the first and second 
halves of the array (by the inductive hypothesis). One of these two numbers is the correct maximum 
for the entire array. The larger is returned. 


If m = 1, then the basis executes, returning 0, which is the number of times 7 can be divided by 2. If 
n> 1, then the function correctly computes the number of times 7/2 can be divided by 2 (by the induc- 
tive hypothesis). This is | less than the number of times n can be divided by 2, so the value returned, 
1 + 1g(n/2), is correct. 


First, we prove the conjecture that the call print(left, right) will print 7! distinct strings, all hav- 
ing the same prefix string left, where n = right. lengthQ). If = 1, the method prints left+right 
and returns; that is 1! (distinct) string. Assume that when right.lengthQ) = n-1, the call 
print(left, right) prints (7-1)! distinct strings all having the same left prefix string. Then, when 
right. length) =n, the for loop makes n calls of the form print (left+temp,ss), where temp is 
a distinct character and ss = s.substring(1,n). Since the length of s.substring(1,n) is n—1, 
each of those calls will print (7-1)! distinct strings all having the same left+temp prefix string. 
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9.25 


9.26 


9.27 


9.28 


9.29 


9.30 


9.31 


9.32 


9.33 
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Therefore, the loop will print (”)(n—1)! distinct strings all having the same left prefix string. This 
proves the conjecture by mathematical induction. Now it follows from that conjecture that the call 
print(str) will print m! distinct permutations of the characters in the string str, where n is its 
length. Since that is precisely the total number of permutations that the string has, it follows that the 
method is correct. 


For the factorial function implemented in Example 9.2 on page 165, integer overflow occurs on the 
return type long with 7 = 13 on the author’s computer. So the computable domain for this function is 
O<n¥l2. 


For the sum(b,n) function implemented in Problem 9.2 on page 177 with b = 2, floating point over- 
flow occurs on the return type double with 7 = 1,023 on the author’s computer. So the computable 
domain for this function is 0 <n < 1,022. 


For the Fibonacci function implemented in Example 9.6 on page 167, the overhead from the recursive 
calls degrades the run-time performance noticeably after n = 36 on the author’s computer. So the com- 
putable domain for this function is about 0 <n < 40. 


For the binomial coefficient function implemented in Example 9.7 on page 167, the overhead from the 
recursive calls degrades the run-time performance noticeably after n = 25 on the author’s computer. So 
the computable domain for this function is about 0 <n < 30. 


The Towers of Hanoi program performs: 
a. 31 moves for 5 disks 
b. 63 moves for 6 disks 
ec. 2” —1 moves for n disks 


To prove that the Towers of Hanoi program performs 2” — 1 disk moves for v disks, use mathematical 
induction. The basis is established in Example 9.15 on page 174. To move + 1 disks, it takes 2” — 1 
moves to move all but the last disk to peg B (by the inductive hypothesis). Then it takes 1 move to 
move the last disk to peg C, and 2” — 1 more moves to move the rest of the disks from peg B to peg C 
on top of that last disk. The total is (2”— 1) + 1+(2”-1)=2"!—1, 


For the Ackermann function implemented in Problem 9.14 on page 178, exceptions are thrown for 
m = 17 when n = 2, for m = 5 when n = 3, for m = 4 when n = 4, and for m = 3 when n = 5. So the 
computable domain for this function is restricted to 0 <m < 16 whenn=2, to0 <m<4 when n=3, to 


0<m<3 whenn =4, and to0 <m<2 whenn=5S. 
h(1,A,C,B) 
AER E p< +A AB 
h(1,B,A,C) 


The call tree for Example 9.15 on page 174 is: 
h¢3,A,C,B) h(1,A,C,B) 


[hC1,C,B,A) | 
h(2,C, A,B) ee hC,C,A,B) 
HCA, C,B) 
h(2,B,C,A) << hC,B,C,A) 
hCL,C,B,A) 


hG.BA,O hd,B,A,O 
RCA, C,B) 
h(2,A,B,0) ae hd,A,B,Q) 
hC,B,A,O 


These are more accurate recursive implementation of the sine and cosine functions: 
public static double s(double x) { 
if (Math.abs(x) < 0.00005) { 
return x - x*x*x/6; // basis 


} 


h(4,A,B,0) h(1,A,B,Q) 


Figure 9.8 Call tree 
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return 2*s(x/2)*c(x/2); // recursion 


} 


public static double c(double x) { 
if (Math.abs(x) < 0.00005) { 
return 1.0 - x*x/2; // basis 
} 
return 1 - 2*s(x/2)*s(x/2); + // recursion 
} 


9.34 These are faster converging implementation of the sine and cosine functions: 
public static double s(double x) { 
if (-0.005 < x & x < 0.005) { 
return xX - X*x*x/6 + X*x*x*x*x/120;3 // basis 
} 
return 2*s(x/2)*c(x/2); // recursion 


- 


public static double c(double x) { 
if (-0.005 < x & x < 0.005) { 
return 1.0 - x*x/2 + x*x*x*x*x/24; // basis 
} 
return 1 - 2*s(x/2)*s(x/2); // recursion 
} 


9.35 These are mutually recursive implementations of the hyperbolic sine and cosine functions: 
public static double s(double x) { 
if (-0.005 < x & x < 0.005) { 
return xX + X*x*x/6;3 // basis 
} 
return 2*s(x/2)*c(x/2);  // recursion 


} 


public static double c(double x) { 
if (-0.005 < x & x < 0.005) { 
return 1.0 + x*x/2; // basis 
} 
return 1 + 2*s(x/2)*s(x/2); // recursion 


i 


9.36 This is a recursive implementation of the tangent function: 
public static double t(double x) { 
if (Math.abs(x) < 0.5e-10) { 


ey ky te 


return xX + x*x/3 + x*x*x*x/5; // basis 


} 
double tx2 = t(x/2); 
return 2*tx2/(1 - tx2*tx2); // recursion 


i; 


9.37 This is a recursive evaluation of a polynomial function: 
public static double p(double[] a, double x) { 
// returns a[O] + a[1]*x + a[2]*x*x + ... 
return p(a, x, 0); 


} 


private static double p(double[] a, double x, int k) { 
// returns alk] + a[k+1]*x + a[k+2]*x*x + ... 
if (k == a.length) { 
return 0.0; // basis 
} 
return a[k] + x*p(Ca, x, k+1); // recursion 


} 


CHAPTER 10 


Trees 


A tree is a nonlinear data structure that models a hierarchical organization. The characteristic 
features are that each element may have several successors (called its “children”) and every 
element except one (called the “root’) has a unique predecessor (called its “parent”). Trees are 
common in computer science: Computer file systems are trees, the inheritance structure for Java 
classes is a tree, the run-time system of method invocations during the execution of a Java 
program is a tree, the classification of Java types is a tree, and the actual syntactical definition of 
the Java programming language itself forms a tree. 


TREE DEFINITIONS 


Here is the recursive definition of an (unordered) tree: 


A tree is a pair (r, S), where r is a node and S is a set of disjoint trees, none of 
which contains 7. 


The node sz is called the root of the tree T, and the elements of the set S are called its subtrees. The 
set S, of course, may be empty. The restriction that none of the subtrees contains the root applies 
recursively: r cannot be in any subtree or in any subtree of any subtree. 

Note that this definition specifies that the second component of a tree be a set of subtrees. So 
the order of the subtrees is irrelevant. Also note that a set may be empty, so (7, ©) qualifies as a 
tree. This is called a singleton tree. But the empty set itself does not qualify as an unordered tree. 


EXAMPLE 10.1 Equal Unordered Trees BBs Steen, 

The two trees shown in Figure 10.1 are equal. The tree “3B a oe ae 
on the left has root a and two subtrees B and C, where Soe Be eel 
B=(b, ©), C=(c, {D}), and D is the subtree D = (d, ©). re ot aie iene a a 
The tree on the right has the same root a and the same set eet 
of subtrees {B, C} = {C, B}, so (a, {B, C}) =(a, {C, B}). Figure 10.1 Equal trees 


The elements of a tree are called its nodes. Technically, each node is an element of only one 
subtree, namely the tree of which it is the root. But indirectly, trees consist of nested subtrees, 
and each node is considered to be an element of every tree in which it is nested. So a, b, c, and d 
are all considered to be nodes of the tree A shown Figure 10.2. Similarly, ¢ and d are both nodes 
of the tree C. 
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The size of a tree is the number of nodes it contains. So the : es 
tree A shown in Figure 10.2 has size 4, and C has size 2. A tree eee | 
of size | is called a singleton. The trees B and D shown here are z 
singletons. 

If T= (x, S) is a tree, then x is the root of T and S is its set of | a. \ 
subtrees S = {7,, T,,..., T,}. Each subtree 7, is itself'a tree with e ee ne 
its own root 7;. In this case, we call the node r the parent of / \ zr \ 
each node ,, and we call the 7; the children of r. In general, we : ; ma : 


say that two nodes are adjacent if one is the parent of the other. 


A node with no children is called a /eaf. A node with at least Figure 10.2 Subtrees 
one child is called an internal node. 
A path in a tree is a sequence of nodes (Xp, x1, X2, . . -» X») Wherein the nodes of each pair with 


adjacent subscripts (x;,, x;) are adjacent nodes. For example, (a, b, c, d) is a path in the tree 
shown above, but (a, d, b, c) is not. The /ength of a path is the number m of its adjacent pairs. 

It follows from the definition that trees are acyclic, that is, no path can contain the same node 
more than once. 

A root path for a node x, in a tree is a path (%, x), X2, . . -, X,,) where x,, 1s the root of the tree. A 
root path for a leaf node is called a /eaf-to-root path. 


Theorem 10.1 Every node in a tree has a unique root path. 
For a proof, see Problem 10.1 on page 194. 


The depth of a node in a tree is the length of its root path. Of course, the depth of the root in 
any tree is 0. We also refer to the depth of a subtree in a tree, meaning the depth of its root. 

A level in a tree is the set of all nodes at a given depth. 

The height of a tree is the greatest depth among all of its nodes. By definition, the height of a 
singleton is 0, and the height of the empty tree is —1. For example, the tree A, shown in Figure 
10.2, has height 2. Its subtree C has height 1, and its two subtrees B and D each have height 0. 

A node y is said to be an ancestor of another node x if it is on x’s root path. Note that the root 
of a tree is an ancestor of every other node in the tree. 

A node x is said to be a descendant of another node y if y is an ancestor of x. For each node y 
in a tree, the set consisting of y and all its descendants form the subtree rooted at y. If S is a 
subtree of 7; then we say that T is a supertree of S. 

The path length of a tree is the sum of the lengths of all paths from its root. This is the same as 
the weighted sum, adding each level times the number of nodes on that level. The path length of 
the tree shown here is 1-3 + 2-4 + 3-8 = 35. 


EXAMPLE 10.2 Properties of a Tree 


The root of the tree shown in Figure 10.3 is node a. 
The six nodes a, b, c, e, f, and h are all internal 
nodes. The other nine nodes are leaves. The path (1, 
h, c, a) is a leaf-to-root path. Its length is 3. Node b 
has depth 1, and node m has depth 3. Level 2 consists 
of nodes e, f, g, and h. The height of the tree is 3. 
Nodes a, c, and h are all ancestors of node 1. Node i 
k is a descendant of node c but not of node b. The 
subtree rooted at b consists of nodes b, e, i, and j. 


Figure 10.3 A leaf-to-root path 
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The degree of a node is the number of its children. In Example 10.2, b has degree 1, d has 
degree 0, and h has degree 5. 

The order of a tree is the maximum degree 
among all of its nodes. 

A tree is said to be full if all of its internal nodes 
have the same degree and all of its leaves are at the 
same level. The tree shown in Figure 10.4 is a full Figure 10.4 A full tree 
tree of degree 3. Note that it has a total of 40 nodes. 

qd" +1 _ 


Theorem 10.2 The full tree of order d and height / has 
For a proof, see Problem 10.1 on page 194. 


nodes. 


Corollary 10.1 The height of a full tree of order d and size n is h = log, (nd—n+1)-1. 


h+i_ 


Corollary 10.2 The number of nodes in any tree of height / is at most where d is 


the maximum degree among its nodes. 


DECISION TREES 


A decision tree is a tree diagram that summarizes all the different possible stages of a process 
that solves a problem by means of a sequence of decisions. Each internal node is labeled with a 
question, each arc is labeled with an answer to its question, and each leaf node is labeled with the 
solution to the problem. 


EXAMPLE 10.3 Finding the Counterfeit Coin 


Five coins that appear identical are to be tested to determine which one of them is counterfeit. The only 
feature that distinguishes the counterfeit coin is that it weighs less than the legitimate coins. The only 
available test is to weigh one subset of the coins against another. How should the subsets be chosen to find 
the counterfeit? 


y < 1b} 


{a 


Figure 10.5 A decision tree 
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In the decision tree shown in Figure 10.5, the symbol ~ means to compare the weights of the two 
operands. So, for example, {a, b} ~ {d, e} means to weight coins a and 5 against coins d and e. 


TRANSITION DIAGRAMS 


A transition diagram is a tree or graph (see Chapter 15) whose internal nodes represent differ- 
ent states or situations that may occur during a multistage process. As in a decision tree, each leaf 
represents a different outcome from the process. Each branch is labeled with the conditional 
probability that the resulting child event will occur, given that the parent event has occurred. 


EXAMPLE 10.4 The Game of Craps 


The game of craps is a dice game played by two players, X and ¥. First _X tosses the pair of dice. If the 
sum of the dice is 7 or 11, X wins the game. If the sum is 2, 3, or 12, Y wins. Otherwise, the sum is desig- 
nated as the “point,” to be matched by another toss. So if neither player has won on the first toss, then the 
dice are tossed repeatedly until either the point comes up or a 7 comes up. If a 7 comes up first, Y wins. 
Otherwise, X wins when the point comes up. 


The transition diagram shown in Figure 10.6 models the game of craps. 


7 or 11, so_X wins 


V/3 Xx 


yy 
ne 2/5 X 
¥ (5) 3/5 
> } 
\/9 
5/1 > X 
5/36 6 6/11 gs 
y 
5/36 5/M X 
(8) 6/11 
lf Bat 
7 2/5 » X 
<> (9 yD 3/5 
— } 
Ly 
2) 3 x 
(10) 2/3 
Y 


2, 3, or 12, so Y wins 


Figure 10.6 A decision tree for the game of craps 
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When a pair of dice is tossed, there are 36 possible outcomes (6 outcomes on the first die, and 6 
outcomes on the second for each outcome on the first die). Of those 36 outcomes, | will produce a sum of 
2 (1 + 1), 2 will produce a sum of 3 (1 + 2 or 2+ 1), and 1 will produce a sum of 12 (6 + 6). So there are a 
total of 4 chances out of 36 of the event “2, 3, or 12” happening. That’s a probability of 4/36 = 1/9. 
Similarly, there are 6 ways that a sum of 7 will occur and 2 ways that a sum of 11 will occur, so the proba- 
bility of the event “7 or 11” is 8/36 = 2/9. The other probabilities on the first level of the tree are computed 
similarly. 

To see how the probabilities are computed for the second 1/12 | 
level of the tree, consider the case where the point is 4. If the 
next toss comes up 4, X wins. If it comes up 7, Y wins. 
Otherwise, that step is repeated. The transition diagram 
shown in Figure 10.7 summarizes those three possibilities. 
The probabilities 1/12, 1/6, and 3/4 are computed as shown 
in the transition diagram in Figure 10.7: Figure 10.7 The game of craps 

P(A) = 3/36 = 1/12 

P(7) = 6/36 = 1/3 

P(2,3,5,6,8,9,10,11, or 12) = 27/36 = 3/4 
So once the point 4 has been established on the first toss, X has a probability of 1/12 of winning on the 
second toss and a probability of 3/4 of getting to the third toss. So once the point 4 has been established on 
the first toss, X has a probability of (3/4)(1/12) of winning on the third toss and a probability of (3/4)(3/4) 
of getting to the fourth toss. Similarly, once the point 4 has been established on the first toss, X has a 
probability of (3/4)(1/12) + (3/4)(3/4)(1/12) of winning on the fourth toss, and so on. Summing these 
partial probabilities, we find that once the point 4 has been established on the first toss, the probability that 
X wins on any toss thereafter is 


This calculation applies the formula for geometric series. (See page 323.) 
If the probability is 1/3 that X wins once the point 4 has been established on the first toss, the probabil- 
ity that Y wins at that point must be 2/3. The other probabilities at the second level are computed similarly. 
Now we can calculate the probability that XY wins the game from the main transition diagram: 
2.4 


o2y u =. apa! ue 
= § + 7p(Pad + a(Ps) + 3g(Po) + 3e(Ps) + a(Po) + 75(Pio) 


PEDO 40-49 
_ 244 


495 
= 0.4929 


So the probability that XY wins is 49.29 percent, and the probability that Y wins is 50.71 percent. 
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A stochastic process is a process that can be analyzed by a transition diagram, that is, it can be 
decomposed into sequences of events whose conditional probabilities can be computed. The 
game of craps is actually an infinite stochastic process since there is no limit to the number of 
events that could occur. As with the analysis in Example 10.4, most infinite stochastic processes 
can be reformulated into an equivalent finite stochastic process that is amenable to (finite) 
computers. 

Note that, unlike other tree models, decision trees and transition trees are usually drawn from 
left to right to suggest the time-dependent movement from one node to the next. 


ORDERED TREES 


Here is the recursive definition of an ordered tree: 


An ordered tree is either the empty set or a pair T= (7, 8S), where r is a node 
and S is a sequence of disjoint ordered trees, none of which contains 7. 


The node ¢ is called the root of the tree 7; and the elements of the sequence S are its subtrees. The 
sequence S of course may be empty, in which case 7 is a singleton. The restriction that none of 
the subtrees contains the root applies recursively: x cannot be in any subtree, or in any subtree of 
any subtree, and so on. 

Note that this definition is the same as that for unordered trees except for the facts that the 
subtrees are in a sequence instead of a set and an ordered tree may be empty. Consequently, if 
two unordered trees have the same subsets, then they are equal; but as ordered trees, they won’t 
be equal unless their equal subtrees are in the same order. Also subtrees of an ordered set may be 
empty. 


EXAMPLE 10.5 Unequal Ordered Trees 


The two trees shown in Figure 10.8 are not equal as ordered trees. 


a 


i. 2 


d d 


b 


Figure 10.8 Unequal ordered trees 


The ordered tree on the left has root node a and subtree sequence ( (b, ©), (c, (d, @) ) ). The ordered 
tree on the right has root node a and subtree sequence ( (c, (d, ©) ), (b, @) ). These two subtree 
sequences have the same elements, but not in the same order. Thus the two ordered trees are not the same. 


Strict adherence to the definition reveals a subtlety often missed, as illustrated by the next 
example. 


EXAMPLE 10.6 Unequal Ordered Trees 


a 
The two trees T, = (a, (B, C)) and T, = (a, (B, @, C)) are not the same sf a 
ordered trees, even though they would probably both be drawn the same, as b Cc 


shown in Figure 10.9. Figure 10.9 A tree 
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All the terminology for unordered trees applies the same way to ordered trees. In addition, we 
can also refer to the first child and the last child of a node in an ordered tree. It is sometimes 
useful to think analogously of a human genealogical tree, where the children are ordered by age: 
oldest first and youngest last. 


TRAVERSAL ALGORITHMS 


A traversal algorithm is a method for processing a data structure that applies a given operation 
to each element of the structure. For example, if the operation is to print the contents of the 
element, then the traversal would print every element in the structure. The process of applying 
the operation to an element is called visiting the element. So executing the traversal algorithm 
causes each element in the structure to be visited. The order in which the elements are visited 
depends upon which traversal algorithm is used. There are three common algorithms for travers- 
ing a general tree. 

The level order traversal algorithm visits the root, then visits each element on the first level, 
then visits each element on the second level, and so forth, each time visiting all the elements on 
one level before going down to the next level. If the tree is drawn in the usual manner with its 
root at the top and leaves near the bottom, then the level order pattern is the same left-to-right 
top-to-bottom pattern that you follow to read English text. 


EXAMPLE 10.7 The Level Order Traversal 


The level order traversal of the tree shown in Figure 10.10 would visit the nodes in the following order: 
a, b, c, d, e, f, g, h, i, j, k, I, m. 


Figure 10.10 A level order traversal 


Algorithm 10.1 The Level Order Traversal of an Ordered Tree 
To traverse a nonempty ordered tree: 
1. Initialize a queue. 
Enqueue the root. 
Repeat steps 4—7 until the queue is empty. 
Dequeue node x from the queue. 
Visit x. 
Enqueue all the children of x in order. 


yy se 199 NS. 
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The preorder traversal algorithm visits the root first and then does a preorder traversal recur- 
sively to each subtree in order. 


EXAMPLE 10.8 The Preorder Traversal 


The preorder traversal of the tree shown in Figure 10.11 would visit the nodes in this order: a, 
b, e, h, i, f, c, d, g, j, k, I, m. 


Figure 10.11 A preorder traversal 


Note that the preorder traversal of a tree can be obtained by circumnavigating the tree, begin- 
ning at the root and visiting each node the first time it is encountered on the left. 


Algorithm 10.2 The Preorder Traversal of an Ordered Tree 
To traverse a nonempty ordered tree: 
1. Visit the root. 
2. Do a recursive preorder traversal of each subtree in order. 
The postorder traversal algorithm does a postorder traversal recursively to each subtree 
before visiting the root. 


EXAMPLE 10.9 The Postorder Traversal 


The postorder traversal of the tree shown in Figure 10.12 would visit the nodes in the following order: 
h, i, e, f, b, Cc, j, k, I, m, g, d, a. 


Algorithm 10.3 The Postorder Traversal of an Ordered Tree 


To traverse a nonempty ordered tree: a 
1. Do a recursive preorder traversal of x. \ 
each subtree in order. A Z ‘ 
2. Visit the root. /\ | 
Note that the level order and the preorder traversals e f g 
always visit the root of each subtree first before visit- / \ Ai IS 
ing its other nodes. The postorder traversal always ‘ aa ax 
1 1 m 


visits the root of each subtree last after visiting all of 
Figure 10.12 A tree 
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its other nodes. Also, the preorder traversal always visits the right-most node last, while the 
postorder traversal always visits the left-most node first. 

The preorder and postorder traversals are recursive. They also can be implemented iteratively 
using a stack. The level order traversal is implemented iteratively using a queue. 


Review Questions 


10.1. ~All the classes in Java form a single tree, called the Java inheritance tree. 
a. What is the size of the Java inheritance tree in Java 1.3? 
b. What is the root of the tree? 
c. What kind of node is a final class in the Java inheritance tree? 


10.2. True or false: 

The depth of a node in a tree is equal to the number of its ancestors. 

The size of a subtree is equal to the number of descendants of the root of the subtree. 
If x is a descendant of y, then the depth of x is greater than the depth of y. 

If the depth of x is greater than the depth of y, then x is a descendant of y. 

A tree is a singleton if and only if its root is a leaf. 

Every leaf of a subtree is also a leaf of its supertree. 

The root of a subtree is also the root of its supertree. 

The number of ancestors of a node equals its depth. 

If R is a subtree of S and S is a subtree of 7; then RF is a subtree of T. 

A node is a leaf if and only if it has degree 0. 

In any tree, the number of internal nodes must be less than the number of leaf nodes. 
A tree is full if and only if all of its leaves are at the same level. 

m. Every subtree of a full binary tree is full. 

n. Every subtree of a complete binary tree is complete. 


10.3. For the tree shown in Figure 10.13, find: 
a. all ancestors of node F 
b. all descendants of node F ® 
c. all nodes in the subtree rooted at F 
d. all leaf nodes 


10.4 ‘For each of the five trees shown in Figure 10.14 ® © 
on page 195, list the leaf nodes, the children of 
node C, the depth of node F, all the nodes at 
level 3, the height, and the order of the tree. © ) 


Se Fa a ee 


10.5 How many nodes are in the full tree of: 
a. order 3 and height 4? 
b. order 4 and height 3? 


c. order 10 and height 4? O 
d. order 4 and height 10? 
10.6 Give the order of visitation of the tree shown in 
Example 10.2 on page 187 using the: 
a. level order traversal 
b. preorder traversal Figure 10.13 A tree 


c. postorder traversal 
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10.7 


10.8 


10.9 


10.1 
10.2 


©) ® © 


Figure 10.14 A tree 


Which traversals always visit: 
a. the root first? 
b. the left-most node first? 
c. the root last? 
d. the right-most node last? 


The level order traversal follows the pattern as reading a page of English text: left-to-right, 
row-by-row. Which traversal algorithm follows the pattern of reading vertical columns from 
left to right? 


Which traversal algorithm is used in the call tree for the solution to Problem 9.32 on page 
184? 


Problems 


Prove Theorem 10.1 on page 187. 
Prove Theorem 10.2 on page 188. 
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10.4 
10.5 
10.6 
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10.2 


10.3 
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Prove Corollary 10.1 on page 188. 
Prove Corollary 10.2 on page 188. 
Derive the formula for the path length of a full tree of order d and height h. 


The St. Petersburg Paradox is a betting strategy that seems to guarantee a win. It can be 
applied to any binomial game in which a win or lose are equally likely on each trial and in 
which the amount bet on each trial may vary. For example, in a coin-flipping game, bettors 
may bet any number of dollars on each flip, and they will win what they bet if a head comes 
up, and they will lose what they bet if a tail comes up. The St. Petersburg strategy is to con- 
tinue playing until a head comes up, and to double your bet each time it doesn’t. For exam- 
ple, the sequence of tosses is {T, T, T, H}, then the bettor will have bet $1 and lost, then $2 and 
lost, then $4 and lost, then $8 and won, ending up with a net win of -$1 + -$2 +-$4 + $8 = 
$1. Since a head has to come up eventually, the bettor is guaranteed to win $1, no matter how 
many coin flips it takes. Draw the transition diagram for this strategy showing the bettor’s 
winnings at each stage of play. Then explain the flaw in this strategy. 


Some people play the game of craps allowing 3 to be a possible point. In this version, player 
Y wins on the first toss only if it comes up 2 or 12. Use a transition diagram to analyze this 
version of the game and compute the probability that XY wins. 


Seven coins that appear identical are to be tested to determine which one of them is counter- 
feit. The only feature that distinguishes the counterfeit coin is that it weighs less than the 
legitimate coins. The only available test is to weigh one subset of the coins against another. 
How should the subsets be chosen to find the counterfeit? (See Example 10.3 on page 188.) 


Answers to Review Questions 


In the Java inheritance tree: 

The size of the tree in Java 1.3 is 1730. 

The Object class is at the root of the tree. 

A final class is a leaf node in the Java inheritance tree. 


True. 
False: It’s one more because the root of the subtree is in the subtree but is not a descendant of 
itself. 
True 
False 
True 
True 
False 
True 
True 
True 
False 
False 
True 
True 


The leaf nodes are L, M, N, H, O, P, Q; the children of node C are G and H; node F has depth 2; 
the nodes at 3 three are L, M, N, O, P, and Q; the height of the tree is 3; the order of the tree is 4. 

b. The leaf nodes are C, E, GO, P, Q, R, and S; node C has no children; node F has depth 2; the 
nodes at level 3 are L, M, N, and O; the height of the tree is 4; the order of the tree is 4. 


Sp eoe 
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10.4 


10.5 


10.6 


10.7 


10.8 
10.9 


10.1 


10.2 


c. The leaf nodes are C, E, G J, L, N, O, P, W, Y, and Z; node C has no children; node F has depth 2; 
the nodes at level 3 are H, J, and K; the height of the tree is 9; the order of the tree is 3. 

d. The leaf nodes are GH, K, L, N, O, P, Q, R, S, and T; the only child node C has is node E; node 

F has depth 3; the nodes at level 3 are F, G, H, and J; the height of the tree is 5; the order is 5. 

The leaf nodes are D, E, L, N, P, Q, R, S, and T; node C has no children; node F has depth 1; the 

nodes at level 3 are K, L, M, N, and O; the height of the tree is 4; the order of the tree is 5. 


Me 


The ancestors of F are C andA 

The descendants of F are I, K, and L. 

The nodes in the subtree rooted at F are F, 1, K, and L. 
The leaf nodes are D, H, J, K, and L. 


(3° — 1)/2 = 121 nodes 

(44 — 1)/3 = 85 nodes 

(10° — 1)/9 = 11,111 nodes 
(4'! — 1)/3 = 1,398,101 nodes 


Level order: a, b, c, d, e, f, g, h, i, j, k, |, m, n, o. 

Preorder: a, b, e, i, j, c, f, k, g, h, |, m,n, 0, d. 

Postorder: i, j, e, b, k, f, g, |, m, n, 0, h, c, d, a. 

The level order and the preorder traversals always visit the root first. 
The postorder traversal always visits the left-most node first. 


The postorder traversal always visits the root last. 
The preorder traversal always visits the right-most node last. 


Pere pop ae Te ao oP 


The preorder traversal follows the pattern of reading by column from left to right. 


The preorder traversal is used in Problem 9.32 on page 184. 
Solutions to Problems 


Proof of Theorem 10.1 on page 187: 

If there were no path from a given node x to the root of the tree, then the definition of tree would 
be violated, because to be an element of the tree, x must be the root of some subtree. If there were 
more than one path from x back to the root, then x would be an element of more than one distinct 
subtree. That also violates the definition of tree, which requires subtrees to be disjoint. 


Proof of Theorem 10.2 on page 188: 

If the tree is empty, then its height is h = —1 and the number of nodes v = 0. In that case, the 
formula is correct: n = (d“*!—1)/(d-1) = (d*!—-1)(d-1) = (d°-1)/(d-1) = (1-1)(d-1) = 0. 

If the tree is a singleton, then its height is / = 0 and the number of nodes v = 1. In that case, the 
formula is again correct: n = (d“*!—1)/(d-1) = (d*!-1)(d-1) = (d-1)(d-1) = 1. 

Now assume that the formula is correct for any full tree of height h-1, where h = 0. Let Tbe the 
full tree of height /. Then by definition, 7 consists of a root node and a set of d subtrees. And since T 
is full, each of its d subtrees has height h-1. Therefore, by the inductive hypothesis, the number of 
nodes in each subtree is ng = (d“"")*!—1)/(d-1) = (d"—1)(d-1). Thus, the total number of nodes in T 
is 


n= 1+(d)(ns) 


ll 
= 
Q. 
fo 
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le 
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Thus, by the Principle of Mathematical Induction (see page 321), the formula must be correct for all 
full trees of any height. 
Proof of Corollary 10.1 on page 188: 

This proof is purely algebraic: 


qrti_] 
d-1 
n(d—-1) = d'*!_-] 
d'*! = n(d-1)+1 
nd—-n+1 


h+1 = log(nd-—n-+1) 


h = log ,nd-n+1)-1 


Proof of Corollary 10.2 on page 188: 


Let T be a tree of any order d and any height #. Then T can be embedded into the full tree of the 


same degree and height. That full tree has exactly poet nodes, so its subtree T has at most that 


many nodes. 


The path length of a full tree of order d and height h is 2 sha" *1_(h+1)d+1]. For example, 
the path length of the full tree on Figure 10.4 on page idgis 1d2. 


The tree diagram analysis of the St. Petersburg Paradox is shown in Figure 10.15. The flaw in this 
strategy is that there is a distinct possibility (1.e., a positive probability) that enough tails could come 
up in a row to make the required bet exceed the bettor’s stake. After n successive tails, the bettor must 
bet $2”. For example, if 20 tails come up in a row, the next bet will have to be more than a million dol- 
lars! 


Win $1 


Win $1 


Win $1 


—$7+$8 = +81 


—$7-$8 =-$15 


Figure 10.15 Analysis of the St. Petersburg Paradox 


The decision tree for the version of craps where 3 can be a point is shown in Figure 10.16. The proba- 
bility that XY wins this version is 0.5068 or 50.68 percent. 


The decision tree in Figure 10.17 shows all possible outcomes from the algorithm that solves the 7- 
coin problem. 
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7 or 11, so X wins 
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2 or 12, so Y wins 


Figure 10.16 Decision tree for a craps game 


x<J 


Figure 10.17 Decision tree for the 7-coin problem 


Binary Trees 


DEFINITIONS 


Here is the recursive definition of a binary tree: 
A binary tree is either the empty set or a triple T= (x, L, R), where x is a node and L 
and R are disjoint binary trees, neither of which contains x. 
The node x is called the root of the tree 7, and the subtrees L and R are called the /eft subtree and 


the right subtree of T rooted at x. 
Comparing this definition with 


a a 
the one on page 186, it is easy to 
see that a binary tree is just an = a 
b c b c 


ordered tree of order 2. But be 


aware that an empty left subtree is / \ 

different from an empty right d 4 
subtree. (See Example 10.5 on 

page 191.) Consequently, the two Figure 11.1 Unequal binary trees 


binary trees shown Figure 11.1 
are not the same. 
Here is an equivalent, nonrecursive definition for binary trees: 


A binary tree is an ordered tree in which every internal node has degree 2. 


In this simpler definition, the 
leaf nodes are regarded as dummy 


a a 
nodes whose only purpose is to ra x Vi “ 
define the structure of the tree. In b é b . 


node, or just the nul1 reference. 
This may seem inefficient and 
more complex, but it is usually 
easier to implement. In Figure 
11.2, the dummy leaf nodes in the tree on the right are shown as asterisks. 


applications, the internal nodes — 

would hold data, while the leaf \ j \ / \ 
nodes would be either identical d rn: rr 
empty nodes, a single empty / \ 


Figure 11.2 Equal binary trees 
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Except where noted, in this book we adhere to the first definition for binary trees. So some 
internal nodes may have only one child, either a left child or a right child. 

The definitions of the terms size, path, length of a path, depth of a node, level, height, interior 
node, ancestor, descendant, subtree, and supertree are the same for binary trees as for general 
trees. (See page 186.) 


EXAMPLE 11.1 Characteristics of a Binary Tree 


Figure 11.3 shows a binary tree of size 10 and height 3. 
Node a is its root. The path from node h to node b has 
length 2. Node b is at level 1, and node h is at level 3. b is 
an ancestor of h, and h is a descendant of b. The part in the 
shaded region is a subtree of size 6 and height 2. Its root is 
node b. 


COUNTING BINARY TREES 


Figure 11.3 A binary tree 


EXAMPLE 11.2 All the Binary Trees of Size 3 
There are five different binary trees of size n = 3, as shown in Figure 11.4. 


| a 


Figure 11.4 The five different binary trees of size 3 


Four have height 2, and the other one has height 1. 
EXAMPLE 11.3 All the Binary Trees of Size 4 


There are 14 different binary trees of size m = 4, as shown in Figure 11.5. 


or ee ee me 
5 he > Me 


Figure 11.5 The 14 different binary trees of size 4 


Ten have height 3, and the other four have height 2. 
EXAMPLE 11.4 All the Binary Trees of Size 5 


To find all the binary trees of size 5, apply the recursive definition for binary trees. If t is a binary tree 
of size 5, then it must consist of a root node together with two subtrees the sum of whose sizes equals 4. 
There are four possibilities: The left subtree contains either 4, 3, 2, 1, or 0 nodes. 

First count all the binary trees of size 5 whose left subtree has size 4. From Example 11.3, we see that 
there are 14 different possibilities for that left subtree. But for each of those 14 choices, there are no other 
options because the right subtree must be empty. Therefore, there are 14 different binary trees of size 5 
whose left subtree has size 4. 

Next, count all the binary trees of size 5 whose left subtree has size 3. From Example 11.2, we see that 
there are five different possibilities for that left subtree. But for each of those five choices, there are no 
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other options because the right subtree must be a singleton. Therefore, there are five different binary trees 
of size 5 whose left subtree has size 3. 

Next, count all the binary trees of size 5 whose left subtree has size 2. There are only two different 
possibilities for that left subtree. But for each of those two choices, we have the same two different possi- 
bilities for the right subtree because it also must have size 2. Therefore, there are 2 x 2 = 4 different binary 
trees of size 5 whose left subtree has size 2. 

By similar reasoning, we find that there are five different binary trees of size 5 whose left subtree has 
size 1, and there are 14 different binary trees of size 5 whose left subtree has size 0. Therefore, the total 
number of different binary trees of size 5 is 14+5+4+5+14=42. 


FULL BINARY TREES 


A binary tree is said to be full if all its leaves are at ~ < 
the same level and every interior node has two 
children. x ES va * 


EXAMPLE 11.5 The Full Binary Tree of Height 3 7 i‘ / \ / ‘ / ‘ 


The tree shown in Figure 11.6 is the full binary tree of 
height 3. Note that it has 15 nodes: 7 interior nodes and 8 


Figure 11.6 A full binary tree of height 3 
leaves. = bene = 


Theorem 11.1 The full binary tree of height 4 has /= 2" leaves and m = 2" — 1 internal 
nodes. 
Proof: The full binary tree of height / = 0 is a single leaf node, so it has n = 1 node, which is a 
leaf. Therefore, since 2” — 1 = 2°—-1=1-—1=0, and 2” = 2°= 1, the formulas are correct for the 
case where h = 0. More generally, let h > 0 and assume (the inductive hypothesis) that the formu- 
las are true for all full binary trees of height less than 4. Then consider a full binary tree of height 
h. Each of its subtrees has height  — 1, so we apply the formulas to them: /, = /, = 2"! and m, = 
= 2’! _ |, (These are the number of leaves in the left subtree, the number of leaves in the right 
subtree, the number of internal nodes in the left subtree, and the number of internal nodes in the 
right subtree, respectively.) Then 


1=1, + ly = 2h) + 21 = 2-21 = 9h 


and 


m=m,+m y+ 1=(2%1-1)4+ 281 -1)4+1=2-241-1=2!-1 
Therefore, by the (Second) Principle of Mathematical Induction, the formulas must be true for 
full binary trees of any height / = 0. 
By simply adding the formulas for m and /, we obtain the first corollary. 


Corollary 11.1 The full binary tree of height / has a total of m = 2"*! — 1 nodes. 


By solving the formula n = 2"! — 1 for h, we obtain this corollary: 


Corollary 11.2 The full binary tree with 7 nodes has height # = lg(m+1) — 1. 
Note that the formula in Corollary 11.2 is correct even in the special case where n = 0: The empty 
binary tree has height 4 = lg(n+1) — 1 =1g(0+1) — 1 =Ig(1)-—1=0-1=-1. 
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The next corollary applies Corollary 11.1 together with the fact that the full binary tree of 
height / has more nodes than any other binary tree of height h. 


Corollary 11.3 In any binary tree of height h, 
h+1<n<2"~-1 andllgn|]<h<n-l 
where 7 is the number of its nodes. 


IDENTITY, EQUALITY, AND ISOMORPHISM 


In a computer, two objects are identically equal if they occupy the same space in memory, so 
they have the same address. In other words, there really only one object, but with two different 
names. That meaning of equality is reflected in Java by the equality operator. If x and y are refer- 
ences to objects, then the condition (x == y) will be true only if x and y both refer to the same 
object. 

But the normal concept of equality in mathematics is that the two things have the same value. 
This distinction is handled in Java by the equals() method, defined in the Object class (see 
Chapter 4) and thus inherited by every class. As defined there, it has the same effect as the equals 
operator: x.equals(y) means x == y. But that equals() method is intended to be overridden 
in subclasses so that it will return true not only when the two objects are identically equal, but 
also when they are separate objects that are “the same” in whatever sense the class designer 
intends. For example, x.equals(y) could be defined to be true for distinct instances x and y of 
Point class if they have the same coordinates. 


EXAMPLE 11.6 Testing Equality of Strings 


1 public class TestStringEquality { 

2 static public void main(String[] args) { 
3 String x = new String(C"ABCDE") ; 

4 String y = new StringC"ABCDE") ; 

5 System.out.printIn("x = 
6 

7 

8 

9 


"+ x);3 

System.out.printIn("y = " + y); 
System.out.printiIn("(x == y) =" + (& == y))3 
System.out.printIn("x.equals(y) = " + x.equals(y)); 

} 

10 } 
The output is: 
x = ABCDE 
y = ABCDE 


(x == y) = false 
x.equals(y) = true 
Here, the two objects x and y (or, more precisely, the two objects that are referenced by the reference 
variables x and y) are different objects, occupying different memory locations, so they are not identically 
equal: (x == y) evaluates to false at line 7. But they do both have the same contents, so they are 
mathematically equal, and x.equals(y) evaluates to true at line 8. 


The distinction between identical equality and mathematical equality exists in Java only for 
reference variables (i.e., only for objects). For all variables of primitive types, the equality opera- 
tor tests for mathematical equality. 
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Data structures have both content and structure. So it is possible for two data structures to 
have equal contents (i.e., have the same contents) but be organized differently. For example, two 
arrays could both contain the three numbers 22, 44, and 88, but in different orders. 


EXAMPLE 11.7 Testing Equality of Arrays 


1 public class TestArraysEquality { 
2 public static void main(String[] args) { 
3 int[] x = { 22, 44, 88 }; 
4 int[] y = { 88, 44, 22 }; 
5 ch02.ex02.DuplicatingArrays.print(x) ; 
6 ch02.ex02.DuplicatingArrays.print(y) ; 
7 System.out.printInC"Arrays.equals(x, y) = " + Arrays.equals(x, y)); 
8 Arrays.sort(x); 
9 Arrays.sort(y); 
10 ch02.ex02.DuplicatingArrays.print(x) ; 
1 ch02.ex02.DuplicatingArrays.print(y) ; 
12 System.out.printInC"Arrays.equals(x, y) = " + Arrays.equals(x, y)); 
13 } 
14 } 
The output it: 
{22, 44, 88} 
{88, 44, 22} 
Arrays.equals(x, y) = false 
{22, 44, 88} 
{22, 44, 88} 


Arrays.equals(x, y) = true 
This shows that the java.util.Arrays.equal() method requires not only the same contents for 
arrays to be equal, but also in the same order, as would be expected. 


Equality is a weaker relation than identity: Identical objects are always equal, but equal 
objects may not be identical; they could be distinct. Equality of data structures means the same 
structure and the same contents in the same order. 

A weaker kind of reflexive relation is isomorphism. Two data structures are isomorphic if they 
have the same structure. This concept is used when the “data” part of the data structure is irrele- 
vant. 

Two arrays are isomorphic if they have the same length. 

Two trees are isomorphic if one tree can be rearranged to match the other. More formally, 7) is 
isomorphic to T, (sometimes written 7, = T,) if there is a one-to-one mapping (an isomorphism) 
between them that preserves parent-child relationship between all nodes. 


EXAMPLE 11.8 Isomorphic Trees 


As unordered trees, Tree 1 and Tree 2 in Figure 11.7 are isomorphic, but not equal. 

However, Tree 3 is not isomorphic to either of the other two trees because it has only three leaves; the 
other two trees each have four leaves:Tthat’s a different structure. That distinction leads fairly easily to a 
formal deduction that there is no isomorphism between Tree | and Tree 3. 

As ordered trees, Tree | is not isomorphic to Tree 2 because their roots’ left-most subtrees have differ- 
ent sizes. The left-most subtree in Tree 1 has three nodes, while that of Tree 2 has only two nodes. That 
distinction also leads fairly easily to a formal deduction that no isomorphism between Tree | and Tree 2 
can exist. 
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Tree 1 Tree 2 Tree 3 


Figure 11.7 Isomorphic and nonisomorphic trees 


Binary trees are ordered trees. The order of the two children at each node is part of the struc- 
ture of the binary tree. 

Binary trees are ordered trees. So any isomorphism between binary trees must preserve the 
order of each node’s children. 


EXAMPLE 11.9 Nonisomorphic Binary Trees 


Tree 1 Tree 2 


Figure 11.8 Nonisomorphic binary trees 


In Figure 11.8, Binary Tree 1 is not isomorphic to 
Binary Tree 2, for the same reason that the ordered trees 
in Example 11.8 are not isomorphic: The subtrees don’t 
all match, as ordered trees. In Tree 1, the root’s right 
child has a left child; but in Tree 1, the root’s right child 
has no (nonempty) left child. 


COMPLETE BINARY TREES 


A complete binary tree is either a full binary tree 
or one that is full except for a segment of missing 


leaves on the right side of the bottom level. aa a 


EXAMPLE 11.10 A Complete Binary Tree of Height 3 


The tree shown in Figure 11.9 is complete. It is shown together with the full binary tree from which it 
was obtained by adding five leaves on the right at level 3. 


Theorem 11.2 In a complete binary tree of height h, 
h+1<n<2_1 and h=Llg n] 
where v is the number of its nodes. 
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EXAMPLE 11.11 More Complete Binary Trees 


Figure 11.10 shows three more examples of complete binary trees. 


5% SRE Shader er 


Figure 11.10 Complete binary trees 


Complete binary trees are important because they have a simple and natural implementation 
using ordinary arrays. The natural mapping is actually defined for any binary tree: Assign the 
number | to the root; for any node, if i is its number, then assign 2i to its left child and 2i+1 to its 
right child (if they exist). This assigns a unique positive integer to each node. Then simply store 
the element at node i in a[i], where a[] is an array. 

Complete binary trees are important because of the simple way in which they can be stored in 
an array. This is achieved by assigning index numbers to the tree nodes by level, as shown in 
Figure 11.11. The beauty in this natural mapping is the simple way that it allows the array 
indexes of the children and parent of a node to be computed. 


a 
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Figure 11.11 The natural mapping of a complete binary tree 


Algorithm 11.1 The Natural Mapping of a Complete Binary Tree into an Array 
To navigate about a complete binary tree stored by its natural mapping in an array: 

1. The parent of the node stored at location i is stored at location i/2. 

2. The left child of the node stored at location i is stored at location 2i. 

3. The right child of the node stored at location 7 is stored at location 27+ 1. 
For example, node e is stored at index i = 5 in the array; its parent node b is stored at index i/2 
= 5/2 = 2, its left child node j is stored at location 2i = 2:5 = 10, and its right child node k is 
stored at index 2i+ 1=2:5+1=11. 


The use of the adjective “complete” should now be clear: The defining property for complete 
binary trees is precisely the condition that guarantees that the natural mapping will store the tree 
nodes “completely” in an array with no gaps. 
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EXAMPLE 11.12 An Incomplete Binary Tree 


Figure 11.12 shows the incomplete binary tree ra ‘S 
from Example 11.1 on page 201. The natural 


mapping of its nodes into an array leaves some gaps, 
as shown in Figure 11.13. / \. 


Note: Some authors use the term “almost yi i ( 


complete binary tree” for a complete binary tree 
and the term “complete binary tree” for a full 
binary tree. 


j 
Figure 11.12 A binary tree 
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Figure 11.13 The natural mapping of an incomplete binary tree 


BINARY TREE TRAVERSAL ALGORITHMS 


The three traversal algorithms that are used for general trees (see Chapter 10) apply to binary 
trees as well: the preorder traversal, the postorder traversal, and the level order traversal. In 
addition, binary trees support a fourth traversal algorithm: the inorder traversal. These four 
traversal algorithms are given next. 


Algorithm 11.2 The Level Order Traversal of a Binary Tree 
To traverse a nonempty binary tree: 
1. Initialize a queue. 
Enqueue the root. 
Repeat steps 4—7 until the queue is empty. 
Dequeue a node x from the queue. 
Visit x. 
Enqueue the left child of x if it exists. 
7. Enqueue the right child of x if it exists. 


OY. PS GND. 


EXAMPLE 11.13 The Level Order Traversal of a Binary Tree 


Figure 11.14 on page 207 shows how the level order traversal looks on the full binary tree of height 3. 


Figure 11.14 The level order traversal of a binary tree 
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The nodes are visited in the order A, B, C, D, E, F, GH, I, J, K, L, M, N, O. 


Algorithm 11.3. The Preorder Traversal of a Binary Tree 

To traverse a nonempty binary tree: 
1. Visit the root. 
2. Ifthe left subtree is nonempty, do a preorder traversal on it. 
3. Ifthe right subtree is nonempty, do a preorder traversal on it. 


EXAMPLE 11.14 The Preorder Traversal of a Binary Tree 


Figure 11.15 on page 208 shows the preorder traversal on the full binary tree of height 3. 


Figure 11.15 The preorder traversal of a binary tree 
The nodes are visited in the order A, B, D, H, I, E, J, K, C, F, L, M, GN, O. 
Figure 11.16 shows how the preorder traversal of a binary tree can be obtained by circumnav- 


igating the tree, beginning at the root and visiting each node the first time it is encountered on the 
left: 


Figure 11.16 The preorder traversal of a binary tree 


Algorithm 11.4 The Postorder Traversal of a Binary Tree 

To traverse a nonempty binary tree: 
1. Ifthe left subtree is nonempty, do a postorder traversal on it. 
2. Ifthe right subtree is nonempty, do a postorder traversal on it. 
3. Visit the root. 


EXAMPLE 11.15 The Postorder Traversal of a Binary Tree 


Figure 11.17 shows the postorder traversal looks on the full binary tree of height 3. 
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Figure 11.17 The postorder traversal of a binary tree 
The nodes are visited in the order H, I, D, J, K, E, B, L, M, F,N, 0, GC, A. 


The preorder traversal visits the root first and the postorder traversal visits the root last. This 
suggests a third alternative for binary trees: Visit the root in between the traversals of the two 
subtrees. That is called the inorder traversal. 


Algorithm 11.5 The Inorder Traversal of a Binary Tree 

To traverse a nonempty binary tree: 
1. Ifthe left subtree is nonempty, do a preorder traversal on it. 
2. Visit the root. 
3. Ifthe right subtree is nonempty, do a preorder traversal on it. 


EXAMPLE 11.16 The Inorder Traversal of a Binary Tree 


Figure 11.18 shows how the inorder traversal looks on the full binary tree of height 3. 


Figure 11.18 The inorder traversal of a binary tree 


The nodes are visited in the order H, D, I, B, J, E, K, A, L, F, M, C, N, G O. 


EXPRESSION TREES 


An arithmetic expression such as (5 - x)*y + 6/(x + z) is acombination of arithmetic 
operators (+, -, *, /, etc.), operands (5, x, y, 6, z, etc.), and parentheses to override the 
precedence of operations. Each expression can be represented by a unique binary tree whose 
structure is determined by the precedence of operations in the expression. Such a tree is called an 
expression tree. 
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EXAMPLE 11.17 An Expression Tree 


Figure 11.19 shows the expression tree for the 
expression (5 — x)*y + 6/(Xx + Z). 


Here is a recursive algorithm for building an 
expression tree: 


Algorithm 11.6 Build an Expression Tree 
The expression tree for a given expression can 
be built recursively from the following rules: 
1. The expression tree for a single operand is a single root node that contains 
it. 
2. If £, and FE, are expressions represented by expression trees 7, and T,, and if 
op is an operator, then the expression tree for the expression F, op EF, is the 
tree with root node containing op and subtrees 7, and 75. 


Figure 11.19 An expression tree 


An expression has three representations, depending upon which traversal algorithm is used to 
traverse its tree. The preorder traversal produces the prefix representation, the inorder traversal 
produces the infix representation, and the postorder traversal produces the postfix representation 
of the expression. The postfix representation is also called reverse Polish notation or RPN. These 
are outlined on page 109. 


EXAMPLE 11.18 The Three Representations of an Expression 


The three representations for the expression in Example 11.17 are: 
Prefix: +*-5xy/6+XZ 
Infix: 5-x*y+6/Xx+Z 
Postfix (RPN): 5x-y*6xz+/+ 


Ordinary function syntax uses the prefix representation. The expression in Example 11.17 
could be evaluated as 
sum(product(difference(5, x), y), quotient(6, sum(x, z))) 
Some scientific calculators use RPN, requiring both operands to be entered before the operator. 
The next algorithm can be applied to a postfix expression to obtain its value. 


Algorithm 11.7 Evaluating an Expression from Its Postfix Representation 

To evaluate an expression represented in postfix, scan the representation from left to right: 
1. Create a stack for operands. 

Repeat steps 3—9 until the end of representation is reached. 

Read the next token ¢ from the representation. 

If it is an operand, push its value onto the stack. 

Otherwise, do steps 6—9: 

Pop a from the stack. 

Pop b from the stack. 

Evaluate c=atb. 

Push c onto the stack. 

Return the top element on the stack. 


Se $0 O0 2 ON. Oy eS 


— 
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EXAMPLE 11.19 Evaluating an Expression from Its Postfix Representation 


Figure 11.20 shows the evaluation of the expression in Example 11.18 using 2 for x, 3 for y, and 1 for z: 


za] bl] opL] aL] 


bl2] op[+] afi] 


blo] op[+] a[2] 


ble] op[/] af3] ¢[2] 


Figure 11.20 Evaluating a postfix expression 


A BinaryTree CLASS 


Here is a class for binary trees that directly implements the recursive definition. (See 
page 200.) By extending the AbstractCollection class, it remains consistent with the Java 
Collections Framework. (See Chapter 4.) 
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EXAMPLE 11.20 A BinaryTree Class 


1 public class BinaryTree<E> extends AbstractCollection { 
2 protected E root; 

3 protected BinaryTree<E> left, right, parent; 
4 protected int size; 

5 

6 public BinaryTree() { 

7 } 

8 

9 public BinaryTree(E root) { 

10 this.root = root; 

1 size = 1; 

12 } 

13 

14 public BinaryTree(E root, BinaryTree<E> left, BinaryTree<E> right) { 
15 thisCroot); 

16 if Cleft != null) f{ 

17 this. left = left; 

18 left.parent = this; 

19 size += left.sizeQ; 

20 } 

21 if Cright != null) f{ 

22 this.right = right; 

23 right.parent = this; 

24 size += right.sizeQ; 

25 } 

26 } 

27 

28 public boolean equals(Object object) { 

29 if (object == this) { 

30 return true; 

31 } else if (!Cobject instanceof BinaryTree)) { 
32 return false; 

33 } 

34 BinaryTree that = (BinaryTree)object; 

35 return that.root.equals(this. root) 

36 && that. left.equals(this. left) 

37 && that.right.equals(this. right) 

38 && that.parent.equals(this.parent) 

39 && that.size == this.size; 

40 } 

41 

42 public int hashCode() { 

43 return root.hashCode() + left.hashCode() + right.hashCode() + size; 
44 } 

45 

46 public int sizeQ { 

47 return size; 

48 } 

49 

50 public Iterator iterator( { 

51 return new java.util.Iterator() { // anonymous inner class 
52 private boolean rootDone; 


53 private Iterator lIt, rIt; // child iterators 
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54 public boolean hasNext() { 

55 return !rootDone || TIt != null && lIt.hasNextQ) 
56 || rIt != null && rIt.hasNextQ; 
57 } 

58 public Object next(Q) { 

59 if CrootDone) { 

60 if ClIt != null && lIt.hasNextQ) f{ 

61 return 1lIt.nextQ); 

62 } 

63 if CrIt != null && rIt.hasNextQ) f{ 

64 return riIt.nextQ; 

65 } 

66 return null; 

67 } 

68 if Cleft != null) { 

69 JIt = left.iteratorQ; 

70 } 

71 if Cright != null) { 

72 rIt = right.iteratorQ; 

73 } 

74 rootDone = true; 

75 return root; 

76 } 

77 public void remove() { 

78 throw new UnsupportedOperationException() ; 
79 } 

80 33 

81 } 

82 } 


The java.util.AbstractCollection class requires the four methods that are defined here: 
equals(), hashCode(), iterator(), and size().! 

The iterator() method overrides the empty version that is defined in the AbstractCollection 
class. Its job is to build an iterator object that can traverse its BinaryTree object. To do that, it creates its 
own anonymous inner Iterator class using the Java return new construct at line 47. The body of this 
anonymous class is defined between the braces that immediately follow the invocation of the constructor 
Iterator(). Note that this block must be followed by a semicolon because it is actually the end of the 
return statement. The complete construct looks like a method definition, but it is not. It really is a 
complete class definition embedded within a return statement. 

To return an Iterator object, this anonymous class must implement the Iterator interface. (See 
page 77.) This requires definitions for the three methods 

public boolean hasNext() 
public Object next(Q) 
public void remove() 

This implementation is recursive. The hasNext() method invokes the hasNext() methods of iterators 
on the two subtrees, and the next() method invokes the next() methods of those two iterators, named 
JIt and rIt. The other local variable is a flag named rootDone that keeps track of whether the root 
object has been visited yet by the iterator. 

The hasNext() method returns true unless all three parts of the tree have been visited: the root, the 
left subtree, and the right subtree. It does that by using the 1It and rIt iterators recursively. 


1. Actually, the equals() and hashCode() methods are defined in the Object class and do not have 
to be overridden. 
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The next() method also uses the 1It and rIt iterators recursively. If the root has already been 
visited, then the iterator visits the next node in the left subtree if there are any, and otherwise visits the 
next node in the right subtree if there are any. If the root has not yet been visited, then this must be the first 
call to the iterator on that particular subtree, so it initializes the 1It and rIt iterators, sets the rootDone 
flag, and returns the root. 

The remove() method is not implemented because there is no simple way to remove an internal node 
from a binary tree. 


EXAMPLE 11.21 Testing the BinaryTree Class 


1 public class TestBinaryTree { 

2 static public void main(String[] args) { 

3 BinaryTree<String> e = new BinaryTree<String>C"E") ; 

4 BinaryTree<String> g = new BinaryTree<String>C"G") ; 

5 BinaryTree<String> h = new BinaryTree<String>C"H") ; 

6 BinaryTree<String> i = new BinaryTree<String>C"I"); 

7 BinaryTree<String> d = new BinaryTree<String>("D", null, g); 
8 BinaryTree<String> f = new BinaryTree<String>C("F", h, i); 

9 BinaryTree<String> b = new BinaryTree<String>("B", d, e); 

10 BinaryTree<String> c = new BinaryTree<String>C("C", f, null); 
1 BinaryTree<String> tree = new BinaryTree<String>C("A", b, c); 
12 System.out.printf("tree: %s", tree); 

13 } 

14 } 


The output is: 
tree: [A, B, D, G, E, C, F, H, I] 
The program creates the binary tree shown in Figure 11.21 and then indirectly invokes its toStringQ 
method that it inherits from the AbstractCollections class. 
Figure 11.21 shows two views of the same tree. The larger view shows all the details, representing each 
object reference with an arrow. 


Figure 11.21 The binary tree constructed in Example 11.21 
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By extending the AbstractCol lection class, the BinaryTree class automatically inherits 
these methods that are defined by using the iterator() and size() methods: 


public boolean isEmpty() 

public boolean contains(Object object) 

public Object[] toArray() 

public Object[] toArray(Object[] objects) 

public String toString() 

public boolean add (Object object) 

public boolean addA1l1(Collection collection) 
public void clearQ 

public boolean containsAl1(Collection collection) 
public boolean remove(Object object) 

public boolean removeAl1(Collection collection) 
public boolean retainAl1l(Collection collection) 


However, the mutating methods will throw an UnsupportedOperationException because 
they invoke other methods that are not implemented, namely the add() and the 
Iterator.remove() methods. 


EXAMPLE 11.22 Testing the contains() Method on a Binary Tree 


This example builds the same tree as the one in Example 11.21 and then tests the contains () method 
on it and its subtrees: 


1 public class TestContains { 
2 static public void main(String[] args) { 
3 BinaryTree<String> e = new BinaryTree<String>C"E"); 
4 BinaryTree<String> g = new BinaryTree<String>C("G") ; 
5 BinaryTree<String> h = new BinaryTree<String>C"H") ; 
6 BinaryTree<String> i = new BinaryTree<String>C"I"); 
7 BinaryTree<String> d = new BinaryTree<String>("D", null, g); 
8 BinaryTree<String> f = new BinaryTree<String>C("F", h, i); 
9 BinaryTree<String> b = new BinaryTree<String>C("B", d, e); 
10 BinaryTree<String> c = new BinaryTree<String>C("C", f, null); 
11 BinaryTree<String> a = new BinaryTree<String>C"A", b, c); 
12 System.out.printf("a: %s%n", a); 
13 System.out.printIn("a.contains(\"H\") = " + a.contains("H")); 
14 System.out.printf("b: %s%n", b); 
15 System.out.printIn("b.contains(\"H\") = " + b.contains("H")); 
16 System.out.printf("c: %s%n", Cc); 
17 System.out.printIn("c.contains(\"H\") = " + c.contains("H")); 
18 } 
19 } 
The output is: 
a: [A, B, D, G, E, C, F, H, I] 
a.contains("H") = true 
b: [B, D, G, E] y © 
b.contains("H") = false 
Cc: [C, F, H, T] a \) 7 j 
c.contains("H") = true / 


The subtrees b and c are shown in Figure 11.22. | 
The tree a contains the element H. The subtree b | o Y X 
does not contain the element H. The subtree c does \ G df ‘eg 
contain the element H. 2 


Figure 11.22 


216 BINARY TREES [CHAP. 11 


IMPLEMENTATIONS OF THE TRAVERSAL ALGORITHMS 


The iterator that is returned by the iterator() method follows the preorder traversal 
algorithm (Algorithm 11.3 on page 208) to traverse the binary tree. The following modification 
of the BinaryTree class implements all four of the binary tree traversal algorithms. 


EXAMPLE 11.23 Implementing the Four Traversal Algorithms 


1 public class BinaryTree<E> extends AbstractCollection { 
2 // insert lines 2-49 from Example 11.20 on page 212 
50 public Iterator iteratorQ { 

51 return new PreOrder(); 

52 } 

53 

54 abstract public class BinaryTreeIterator implements Iterator { 
55 protected boolean rootDone; 

56 protected Iterator lIt, rit; // child iterators 
57 public boolean hasNext() { 

58 return !rootDone || TIt != null && 1It.hasNextQ) 
59 || rIt != null && rIt.hasNextQ; 
60 } 

61 abstract public Object nextQ); 

62 public void remove() { 

63 throw new UnsupportedOperationException() ; 

64 } 

65 } 

66 

67 public class PreOrder extends BinaryTreelIterator { 
68 public PreOrder() { 

69 if Cleft != null) f{ 

70 JIt = left.new PreOrder(); 

71 } 

72 if Cright != null) { 

73 rIt = right.new PreOrderQ; 

74 } 

75 } 

76 public Object nextQ) { 

77 if C!rootDone) { 

78 rootDone = true; 

79 return root; 

80 } 

81 if CIt != null && lIt.hasNextQO) { 

82 return lIt.nextQ; 

83 } 

84 if CrIt != null && rIt.hasNextQ) { 

85 return rit.nextQ; 

86 

87 return null; 

88 } 

89 } 

90 

91 public class InOrder extends BinaryTreelterator { 

92 public InOrderQ® { 

93 if Cleft != null) f{ 

94 JIt = left.new InOrderQ; 


95 } 
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96 

97 

98 

99 
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103 
104 
105 
106 
107 
108 
109 
110 
111 
112 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 
123 
124 
125 
126 
127 
128 
129 
130 
131 
132 
133 
134 
135 
136 
137 
138 
139 
140 
141 
142 
143 
144 
145 
146 
147 
148 
149 
150 
151 


if Cright != null) { 
rIt = right.new InOrderQ; 
} 


} 
public Object nextQ) { 
if CIt != null && lIt.hasNextQO) { 
return lIt.nextQ; 
} 
if C!rootDone) { 
rootDone = true; 
return root; 


} 
if CrIt != null && rIt.hasNextQ) { 
return rit.nextQ; 
} 
return null; 
} 
} 


public class PostOrder extends BinaryTreelterator { 
public PostOrderQ { 
if Cleft != null) f{ 
JIt = left.new PostOrder(); 
} 
if Cright != null) f{ 
rIt = right.new PostOrderQ; 


} 


} 
public Object nextQ) { 
if CIt != null && lIt.hasNextQO) { 
return lIt.nextQ; 


} 
if CrIt != null && rIt.hasNextQ) { 
return rit.nextQ; 


if C!rootDone) { 
rootDone = true; 
return root; 
} 
return null; 
} 
} 


public class LevelOrder extends BinaryTreelIterator { 
Queue<BinaryTree<E>> queue = new ArrayDeque<BinaryTree<E>>() ; 
public boolean hasNext() { 
return (!rootDone || !queue.isEmpty()); 


} 
public Object nextQ) { 
if C!rootDone) { 
if Cleft != null) { 
queue. add(left) ; 
} 
if Cright != null) { 
queue.add(right) ; 
} 
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152 rootDone = true; 

153 return root; 

154 } 

155 if (!queue.isEmpty()) f 

156 BinaryTree<E> tree = queue.remove() ; 
157 if (tree.left != null) { 
158 queue. add(tree. left) ; 
159 } 

160 if (tree.right != null) { 
161 queue.add(tree. right) ; 
162 } 

163 return tree.root; 

164 } 

165 return null; 

166 } 

167 } 

168 } 


At line 64 we define an abstract inner class named BinaryTreeIterator. This serves as a base class 
for all four of the concrete iterator classes. It declares the same three fields (rootDone, rIt, and 1It) as 
the anonymous iterator class defined previously. 

The hasNext() and remove() methods are implemented (at lines 57 and 62) the same way the abstract 
Iterator class was done in the anonymous iterator class. But the next() method is declared abstract 
because each of the four traversal algorithms has a different implementation of it. 

The PreOrder class defines the 1It and rIt iterators to be PreOrder iterators in its constructor to 
ensure that the recursive traversal follows the preorder traversal algorithm. That algorithm (Algorithm 
11.3 on page 208) says to visit the root first, and then apply the same algorithm recursively to the left 
subtree and then to the right subtree. The three if statements do that at lies 77-86. The only differences 
between the PreOrder, InOrder, and PostOrder classes are their definitions of the recursive rIt and 
1It iterators in the constructors and the order of those three if statements in the next () methods. For the 
InOrder class, the order visits the root between the two recursive traversals. For the PostOrder class, the 
order visits the root after the two recursive traversals. (“Pre” means before, “in” means between, and 
“post” means after.) 

The LevelOrder traversal class is significantly different from the other three. Instead of being recur- 
Sive, it uses a queue. (See Algorithm 11.5 on page 209.) 


EXAMPLE 11.24 Testing the Traversal Algorithms 


1 public class TestIterators { 

2 public static void main(String[] args) { 

3 BinaryTree<String> e = new BinaryTree<String>C"E") ; 

4 BinaryTree<String> g = new BinaryTree<String>("G") ; 

5 BinaryTree<String> h = new BinaryTree<String>C"H") ; 

6 BinaryTree<String> i = new BinaryTree<String>C"I") ; 

7 BinaryTree<String> d = new BinaryTree<String>("D",null,g); 
8 BinaryTree<String> f = new BinaryTree<String>C"F",h,i); 

9 BinaryTree<String> b = new BinaryTree<String>("B",d,e); 

10 BinaryTree<String> c = new BinaryTree<String>("C",f,nul1); 
TT BinaryTree<String> tree = new BinaryTree<String>C("A",b,c); 
12 System.out.printIn(C"tree = " + tree); 

13 java.util.Iterator it; 

14 System.out.print("PreOrder Traversal: die 

15 for (it = tree.new PreOrder(); it.hasNextQ); ) f{ 

16 System.out.printCit.nextQ +" "); 


17 } 


18 System.out.printC"\nInOrder Traversal: "); 
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19 for (it = tree.new InOrder(Q); it.hasNextQ; ) { 
20 System.out.printCit.nextQ +" "); 
21 } 
22 System.out.print("\nPostOrder Traversal: "); 
23 for (it = tree.new PostOrder(); it.hasNextQ); ) { 
24 System.out.printCit.nextQ +" "); 
25 } 
26 System.out.printC"\nLevelOrder Traversal: "); 
27 for (it = tree.new LevelOrder(Q); it.hasNextQ); ) { 
28 System.out.printCit.nextQ +" "); 
29 } 
30 System.out.printInQ); 
31 } 
32 } 
The output is: 
tree = [A, B, D, G, E, C, F, H, I] 
PreOrder Traversal: ABDGECFHI 
InOrder Traversal: DGBEAHFIC 
PostOrder Traversal: GDEBHIFCA 
LevelOrder Traversal: ABCDEFGHI 
Each of the four iterators traverses the tree according to the algorithm that it implements. 


FORESTS 


A forest is a sequence of disjoint ordered trees. 
EXAMPLE 11.25 A Forest 


Figure 11.23 shows a forest that consists of three trees. 


(A) (N)) @ 
Bm (1). © @ ® ® 
CRGRORORO© S) ® 


© O 


Figure 11.23 A forest 


The following algorithm shows how a forest can be represented by a single binary tree. 


Algorithm 11.8 The Natural Mapping of a Forest into a Binary Tree 
1. Map the root of the first tree into the root of the binary tree. 
2. If node X maps into X' and node Y is the first child of X, then map Y into the left 
child of X'. 
3. If node X maps into X' and node Z is the sibling of X, then map Z into the right 
child of X'. The roots of the trees themselves are considered siblings. 
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EXAMPLE 11.26 Mapping a Forest into a Binary Tree 


Figure 11.24 is the mapping of the forest shown in Example 11.25. For example, in the original forest, C 
has oldest child F and next sibling D. In the corresponding binary tree, C has left child F and right child D. 


(A) 


> BG ©—-O © 


© ©-@-H-O O-@® 


K)-O-™) 


Figure 11.24 The natural mapping of a forest into a binary tree 


Review Questions 


11.1. How many leaf nodes does the full binary tree of height / = 3 have? 

11.2. How many internal nodes does the full binary tree of height 4 = 3 have? 
11.3. How many nodes does the full binary tree of height / = 3 have? 

11.4 How many leaf nodes does a full binary tree of height ; = 9 have? 

11.5. How many internal nodes does a full binary tree of height 4 = 9 have? 
11.6 How many nodes does a full binary tree of height / = 9 have? 

11.7. What is the range of possible heights of a binary tree with n = 100 nodes? 
11.8 Why is there no inorder traversal for general trees? 


11.9 True or false: 
a. If all of its leaves are at the same level, then the binary tree is full. 
b. If the binary tree has 1 nodes and height h, then h > Lig. 
c. A binary tree cannot have more than 24 nodes at depth d. 
d. If every proper subtree of a binary tree is full, then the tree itself must also be full. 


Problems 


11.1. For each of the binary trees in Figure 11.25 on page 221, draw the equivalent version that sat- 
isfies the second definition, namely that every internal node has two children. 


11.2. Give the order of visitation of the binary tree shown in Figure 11.26 using the specified tra- 
versal algorithm: 
a. the level order traversal 
b. the preorder traversal 
c. the inorder traversal 
d. the postorder traversal 
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Figure 11.25 Binary trees 


(A) 
(B) C) 
QO ® © © 
Oe ORC 


Figure 11.26 A binary tree 


Figure 11.27 A binary tree 


11.3. Give the order of visitation of the binary tree of size 10 shown in Example 11.1 on page 201 
using: 
a. the level order traversal 
b. the preorder traversal 
c. the inorder traversal 
d. the postorder traversal 
11.4 — Give the order of visitation of the binary tree shown in Figure 11.27 using: 
a. the level order traversal 
b. the preorder traversal 
c. the inorder traversal 
d. the postorder traversal 
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11.9 

11.10 
11.11 
11.12 
11.13 


11.14 
11.15 
11.16 
11.17 
11.18 


11.19 


11.20 
11.21 
11.22 


11.23 


11.24 
11.25 
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Show the array that is obtained by using the natural mapping to store the binary tree shown in 
Problem 11.1. 


Show the array that is obtained by using the natural mapping to store the binary tree shown in 
Example 11.1 on page 201. 


Show the array that is obtained by using the natural mapping to store the binary tree shown in 
Problem 11.4. 


If the nodes of a binary tree are numbered according to their natural mapping, and the visit 
operation prints the node’s number, which traversal algorithm will print the numbers in 
order? 


Draw the expression tree for a*(b + c)*(d*e + f). 

Write the prefix and the postfix representations for the expression in Problem 11.8. 

Draw the expression tree for each of the prefix expressions given in Problem 5.2 on page 111. 
Draw the expression tree for each of the infix expressions given in Problem 5.4 on page 111. 


Draw the expression tree for each of the postfix expressions given in Problem 5.6 on page 
111. 


Draw the expression tree for the expression a*(b + c)*(d*e + f). 

What are the bounds on the number x of nodes in a binary tree of height 4? 
What are the bounds on the height / of a binary tree with 7 nodes? 

What form does the highest binary tree have for a given number of nodes? 


What form does the lowest binary tree (i.e., the least height) have for a given number of 
nodes? 


Verify the recursive definition of binary trees (page 200) for 
the binary tree shown in Figure 11.28. 


Draw all 42 binary trees of size n = 5. 


How many different binary trees of size n = 6 are there? 


Derive a recurrence relation for the number f(7) of binary trees 


of size n Figure 11.28 A binary tree 


Show that, for all  < 8, the function f() derived in Problem 11.22 produces the same 


sequence as the following explicit formula 


2n 
& _ _Qn)! (Qn)(2n—-1)(2n—2).--Qn+3)(2Qn+ 2) 
n+l nl(n+1)! (n)(n—-1)(n—-2)(n-3)---(2)C1) 


fin) = 
For example, 


() 
A BN os ABV NO) OID) 
ame aa 15 (4)(3)(2)C1) 4 mt 


Prove Corollary 11.3 on page 203. 
Prove Theorem 11.2 on page 205. 
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11.26 


11.27 


11.28 


11.29 


11.9 


Draw the forest that is represented by the binary tree 
shown in Figure 11.29. 


Derive an explicit formula for the number f(h) of 
complete binary trees of height h. 


Derive an explicit formula for the number f(/) of full 
binary trees of height h. 


Implement the each of the following methods for the 
BinaryTree class: Figure 11.29 A binary tree 
a. public int leaves(); 
// returns the number of leaves in this tree 
b. public int height(); 
// returns the height of this tree 
Cc. public int level(Object object); 
// returns -1 if the given object is not in this tree; 
// otherwise, returns its level in this tree; 
d. public void reflect(Q); 
// swaps the children of each node in this tree 
e. public void defoliateQ; 
// removes all the leaves from this tree 


Answers to Review Questions 


The full binary tree of height 3 has / = 2? = 8 leaves. 

The full binary tree of height 3 has m = 2? — 1 = 7 internal nodes. 

The full binary tree of height 3 has n = 27*! — 1 =24-1= 16-1 =15 nodes. 

The full binary tree of height 9 has / = 2° = 512 leaves. 

The full binary tree of height 9 has m = 2? — 1 = 512 — 1 = 511 internal nodes. 

The full binary tree of height 9 has n = 2°*! — 1 = 2!°— 1 = 1024 — 1 = 1023 nodes. 


By Corollary 11.3, in any binary tree: Llg 2] < h < n-1. Thus in a binary tree with 100 nodes Lig 
100] < h < 100-1 = 99. Since Llg 100] =|(1og100)/(log2) | = 6.6] = 6, it follows that the height 
must be between 6 and 99, inclusive: 6<h< 99. 


The inorder traversal algorithm for binary trees recursively visits the root in between traversing the left 
and right subtrees. This presumes the existence of exactly two (possibly empty) subtrees at every 
(nonempty) node. In general trees, a node may have any number of subtrees, so there is no simple 
algorithmic way to generalize the inorder traversal. 


a. True 
b. True 
c. True 
d. False 


Solutions to Problems 


The equivalent trees are shown in Figure 11.30. 
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Figure 11.30 Binary trees 


11.2.‘ The order of visitation in the binary tree traversal: 
a. Level order: A, B, C, D, E, F, GH, I, J, K 
b. Preorder: A, B, D, E, H, I, C, F, J, G K 
c. Inorder: D, B, H, E, I, A, F, J, C, GK 
d. Postorder: D,H, I, E,B, J,F,K,GCA 


11.3. The order of visitation in the binary tree traversal: 
a. Level order traversal: A, B, C, D, E, F, H, I, J,M 
b. Preorder traversal: A, B,D, H, I, E, J, C, F,M 
c. Inorder traversal: H, D, I, B, J, E, A, F, M, C 
d. Postorder traversal: H, I, D, J, E, B, M, F,C,A 
11.4 =‘ The order of visitation in the binary tree traversal: 
a. Level order traversal: A, B, C, D, E, F, GH, J, K, L, M 
b. Preorder traversal: A, B, D, G M, H, C, E, J, N, F, K, 0, 
c. Inorder traversal: G M, D, H, B, A, N, J, E, C, K, 0, F, L 
d. Postorder traversal: M, GH, D, B, N, J, E,0,K,L,F,C,A 
11.5 =‘ The natural mapping of the specified binary tree is shown in Figure 11.31. 


10 


ATB|CID|E [FG] | [H|Z| [3] |K 
Figure 11.31 An array 


11.6 The natural mapping of the specified binary tree is shown in Figure 11.32. 


il 12 


A|B|C|D[E|F| |H|1|J M 


Figure 11.32 An array 
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11.7.‘ The natural mapping of the specified binary tree is shown in Figure 11.33. 


il 12 13 i4 15 16 17 18 19 20 


A B C D 5 : c C H 10 j K L M 2 21 22 23 N ie] ho 


Figure 11.33 An array 


11.8 = The level order traversal will print the numbers from the natural mapping in order. 
11.9 The expression tree for a*(b + c)*(d*e + f) is shown in Figure 11.34. 


va \ 
/, 
eS 
/ \ 
Figure 11.34 A binary tree 
11.10 The prefix expression is *a*+bhc+*def. The postfix expression is *abc+de*f+**. 


11.11 Figure 11.35 shows the expression tree for each of the prefix expressions given in Problem 5.2 on page 
111. 


a. - b. / c. / 
| ra \ . PA \. ge ~< | 
"aii 
A x fs Be Oe Va \ 

+ a /\ b * 
/\ ai a 
ab de 
Figure 11.35 Prefix expression trees 
11.12 Figure 11.36 shows the expression tree for each of the infix expressions given in Problem 5.4 on page 

111. 
a. rae b. 7 Cc. /\ 
+ / a * / - 
a b c + * - a / da e 
de becde be 


Figure 11.36 Infix expression trees 
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11.13 


11.14 


11.15 
11.16 
11.17 


11.18 


11.19 


11.20 
11.21 
11.22 
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Figure 11.37 shows the expression tree for each of the postfix expressions given in Problem 5.6 on 
page 111. 


7s £% 4 
7% pe * | a 


\ /\ /\ [\ 


a bab bcde a 
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/\ 

de 


Figure 11.37 Postfix expression trees 


Figure 11.38 shows the expression tree for 
a*(b + c)*(d*e +f) is: 


In a binary tree of heighth =4,5 <n 31. 
In a binary tree with n = 7 nodes, 2 <h<6. 


For a given number of nodes, the highest 
binary tree is a linear sequence. 


For a given number of nodes, the lowest 
binary tree is a complete binary tree. 


To verify the recursive definition for the 
given tree, we first note that the leaves C, E, 
and F are binary trees because every single- 
ton satisfies the recursive definition for binary trees because its left and right subtrees are both empty 
(and therefore binary trees). Next, it follows that the subtree rooted at B is a binary tree because it is a 
triplet (X,L,R) where XY = B, L = ©, and R = C. Similarly, it follows that the subtree rooted at D is a 
binary tree because it is a triplet (X,L,R) where X = D, L = E, and R = F. Finally, it follows that the 
entire tree satisfies the recursive definition because it is a triplet (X,L,R) where X = A, L is the binary 
tree rooted at B, and L is the binary tree rooted at D. 


Figure 11.38 An expression tree 


Figure 11.39 on page 227 shows all 42 different binary trees of size n = 5. 
There are 132 different binary trees of size 6: 1-42 + 1-14+2-54+5-2+ 14-1+ 42-1 = 132. 


A nonempty binary tree consists of a root X, a left subtree Z, and a right subtree R. Let be the size of 
the binary tree, let n, = |L| = the size of L, and nz = |R| = the size of R. Thenn = 1 +n, + nz. So there 
are only n different possible values for the pair (7; , np): (0, m—1), (1, m2), ..., (n—1,0). For example, if 
n = 6 (as in Problem 11.21), the only possibilities are (0,5), (1,4), (2,3), (3,2), (4,1), or (5,0). In the 
(0, n—1) case, ZL is empty and |R| = n—1; there are /(0)-f(n—-1) different binary trees in that case. In the 
(1, -2) case, L is a singleton and |R| = n—2; there are f(1)-/(7—2) different binary trees in that case. The 
same principle applies to each case. Therefore the total number of different binary trees of size 7 is 
f(r) = 1-f(n-1) + 1-f(n-2) + 2-f(n-3) + 5-f(n-4) + 14-f(n—5) + + + fCi-l)-f(n-i) + + f(n-1)1 


In closed form, the formula is 


fn) = AG-V) fri) 
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Figure 11.39 The 42 binary trees of size 5 


11.23 These are called the Catalan numbers: 


n 2) n+l (2") Y fi- VD) fla i) 
n n 
(n+1) 
0 1 1 1) 1 
1 2. 2 1) 11=1 
2 6 3 2) 1-14+1-1=2 
3 20 4 5 | 1-2+1-142-1=5 
4 70 5 14 | 1-5+1-2+2-1+5-1 = 14 
5 | 252 6 42 | 1-144+1-5+2-2+5-1+14-1 = 42 
6| 924 7 132 | 1-42+1-14+2-5+5-2+14-1+42-1 = 132 
7 | 3432 8 429 | 1-1324+1-42+2-14+5-5+14-2+42-1+132-1 = 429 


Table 11.1 Catalan numbers 


11.24 Fora given height h > 0, the binary tree with the most nodes is the full binary tree. Corollary 11.1 on 
page 202 states that that number is n = 2"! _ 1. Therefore, in any binary tree of height /, the number 
n of nodes must satisfy n < of 1 The binary tree with the fewest nodes for a given height / is the 
one in which every internal node has only one child; that linear tree has n = h + 1 nodes because every 
node except the single leaf has exactly one child. Therefore, in any binary tree of height /, the number 
n of nodes must satisfy n = h + 1. The second pair of inequalities follows from the first by solving for 
h. 


11.25 Let 7 be any binary tree of height / and size n. Let T, be the smallest complete binary tree that con- 
tains T. Let h, be the height of 7, and let 7, be its size. Then h = h, and n <n,. Then by Corollary 11.1 
on page 202,n <n, = Ql 1) The required inequalities follow from this result. 
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11.26 Figure 11.40 shows how the forest that produced the specified binary tree was obtained by reversing 
the natural map. 


11.27 fh)=h+1 
11.28 fh)=1 
11.29 a. 


b. 


c. 


d. 


Figure 11.40 Mapping a forest into a binary tree 


public int leaves() { 


} 


if (this == null) { 

return 0; 
} 
int leftLeaves = (left==null ? 0: left.leavesQ)); 
int rightLeaves = (right==null ? 0 : right. leaves()); 
return leftLeaves + rightLeaves; 


public int heightQ { 


i 


if (this == null) { 
return -1; 
} 
int leftHeight = (left==null ? -1 : left.heightQ); 
int rightHeight = Cright==null ? -1 : right.heightQ); 
return 1 + CleftHeight<rightHeight ? rightHeight : leftHeight); 


public int level(Object object) { 


} 


if (this == null) { 


return -1; 
} else if (object == root) { 
return 0; 


} 
int leftLevel = (left==null ? -1 : left. level (object)); 


int rightLevel = (Cright==null ? -1 : right. level (Cobject)); 
if CleftLevel < 0 && rightLevel < 0) { 
return -1; 


} 
return 1 + (leftLevel<rightLevel ? rightLevel : leftLevel); 


public void reflect() { 


if (this == null) { 
return; 

} 

if Cleft != null) { 
left.reflectQ); 

} 

if Cright != null) { 
right.reflectQ; 


} 
BinaryTree temp=left; 
left = right; 


right = temp; 
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e. public void defoliate() { 
if (this == null) { 
return; 
} else if Cleft == null && right == null) { 
root = null; 
return; 


} 

if Cleft != null && left. left==null && left.right==null) { 
left = null; 

} else { 
left.defoliate(); 


} 
if Cright != null && right. left==null && right.right==nul1) 
right = null; 
} else { 
right.defoliateQ; 
} 
} 


Search Trees 


Tree structures are used to store data because their organization renders more efficient access 
to the data. A search tree is a tree that maintains its data in some sorted order. 


MULTIWAY SEARCH TREES 


Here is the recursive definition of a multiway search tree: 


A multiway search tree of order m is either the empty set or a pair (Kk, S), 
where the first component is a sequence k = (k,, k,,..., k,_,) of n—1 keys and 
the second component is a sequence S = (S,, S|, .8,,.. .,S,,_;) of m multiway 
search trees of order m, with2 <n < m,ands, < k, < s, <... < k,, < S,, 
for each s, € S,. 


This is similar to the recursive definition of a general tree on page 186. A multiway search tree of 
order m can be regarded as a tree of order m in which the elements are sequences of keys with the 
ordering property described above. 


EXAMPLE 12.1 A Five-Way Search Tree 


Here is an m-way search tree with m = 5. It has three internal nodes of degree 5 (each containing four 
keys), three internal nodes of degree 4 (each containing three keys), four internal nodes of degree 3 (each 
containing two keys), and one internal node of degree 2 (containing one key). 


[48/50/55] 61/64/65]67| 


Figure 12.1 A five-way search tree 
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The root node has two keys and three children. All four keys in the first child are less than k, = 57. All 
three keys in the second child are between 4, = 57 and k, = 72. Both keys in the third child are greater 
than k, = 72. In fact, all thirteen keys in the first subtree are less than 57, all seven keys in the second 
subtree are between 57 and 72, and all eight keys in the third subtree are greater than 72. 


An m-way search tree is called a search tree because it serves as a multilevel index for search- 
ing large lists. To search for a key value, begin at the root and proceed down the tree until the key 
is found or a leaf is reached. At each node, perform a binary search for the key. It it is not found 
in that node, the search will stop between two adjacent key values (with kj =—© and k, = ©), 
In that case, follow the link that is between those two keys to the next node. If we reach a leaf, 
then we know that the key is not in the tree. 

For example, to search for key value 66, start at the root of the tree and then follow the middle 
link (because 57 < 66 < 72) down to the middle three-key node. Then follow its third link 
(because 60 < 66 < 70) down to the bottom four-key node. Then follow its third link (because 
65 < 66 < 67) down to that leaf node. Then conclude that the key 66 is not in the tree. 

To insert a key into an m-way search tree, first apply the search algorithm. If the search ends at 
a leaf node, then the two bracketing keys of its parent node locate the correct position for the new 
key. So insert it in that internal node between those two bracketing keys. If that insertion gives 
the node m keys (thereby exceeding the limit of m—1 keys per node), then split the node into two 
nodes after moving its middle key up to its parent node. If that move gives the parent node m 
keys, repeat the splitting process. This process can iterate all the way back up to the root, if 
necessary. Splitting the root produces a new root, thereby increasing the height of the tree by one 
level. 


EXAMPLE 12.2 Inserting into a Five-Way Tree 


To insert 66 into the search tree of Example 12.1, first perform the search, as described above. This 
leads to the leaf node marked with an X in Figure 12.2: 


Figure 12.2 Inserting 66 into a five-way search tree 


Insert the new key 66 in that last parent node between the bracketing keys 65 and 67 as shown in Figure 
12.3 on page 232. 

Now that node contains five keys, which violates the four-key limit for a five-way tree. So the node 
gets split, shifting its middle key 65 up to its parent node as shown in Figure 12.4 on page 232. 


Node splitting occurs relatively infrequently, especially if m is large. For example, if m = 50, 
then on average only 2 percent of the nodes would be full, so a bottom-level split would be 
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OOO I | 


Figure 12.3 Inserting 66 into a five-way search tree 


a) | jf 


Figure 12.4 Inserting 66 into a five-way search tree 


required for only about 2 percent of the insertions. Furthermore, a second-from-bottom-level 
split (i.e., a double split) would be required for only about 2 percent of 2 percent of the inser- 
tions, that is, with probability 0.0004. And the probability of a triple split would be 0.000008. So 
the chances of the root being split are very small. And since that is the only way that the tree can 
grow vertically, it tends to remain a very shallow, very broad tree, providing very fast search 
time. 


B-TREES 


A B-tree of order m is an m-way search tree that satisfies the following extra conditions: 
1. The root has at least two children. 
2. All other internal nodes have at least [m/2 | children. 
3. All leaf nodes are at the same level. 
These conditions make the tree more balanced (and thus more efficient), and they simplify the 
insertion and deletion algorithms. 

B-trees are used as indexes for large data sets stored on disk. In a relational database, data are 
organized in separate sequences of records called tables. Each table could be stored as a sequen- 
tial data file in which the records are numbered like the elements of an array. Or the database 
system might access the records directly by their disk addresses. Either way, each record is 
directly accessible on disk via some addressing scheme. So once we have the record’s disk 
address, we can access it immediately (1.e., with a single disk read). So the “key” that is stored in 
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the B-tree is actually a key/address pair containing the record’s actual key value (e.g., a U.S. 
Social Security number for personnel records, or an ISBN for books) together with its disk 
address. In the outline that follows, only the key value is shown, the accompanying disk address 
being understood to accompany it. 


EXAMPLE 12.3 A B-Tree 


Figure 12.5 shows a B-tree of order 5. Each of its internal nodes has 3, 4, or 5 children, and all the 
leaves are at level 3. 


Figure 12.5 A B-tree of order 5 


Algorithm 12.1 Searching in a B-Tree 
To find a record with key k using a B-tree index of order m: 

1. Ifthe tree is empty, return nul]. 

2. Let x be the root. 

3. Repeat steps 4—6 until x is a leaf node. 

4. Apply the binary search (page 31) to node x for the key k,, where k_, <k < k;, 
(regarding k, =—-© andk,,= ©). 
If k,; =k, retrieve the record from disk and return it. 

6. Let x be the root of subtree S;. 

Return nu11. 


nN 


Note how similar this process is to looking up a topic in the index of a book. Each page of the 
index is labeled with a word or letter that represents the topics listed on that page. The page 
labels are analogous to the keys in the internal nodes of the search tree. The actual page number 
listed next to the topic in the book’s index is analogous to the disk address of file name that leads 
you to the actual data. The last step of the search process is searching through that page in the 
book, or through that file on the disk. This analogy is closer if the book’s index itself had an 
index. Each internal level of the multiway tree corresponds to another index level. 


Algorithm 12.2 Inserting into a B-Tree 
To insert a record with key k using a B-tree index of order m: 
1. If the tree is empty, create a root node with two dummy leaves, insert k 
there, and return true (indicating that the insertion was successful). 
2. Let x be the root. 
3. Repeat steps 4—6 until x is a leaf node. 
4. Apply the binary search to node x for the key k,, where k;_, <k < k; (regard- 
ing k)=-© andk,,= ©). 
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If k; = k, return false (indicating that the insertion was unsuccessful 
because a record with key & already exists, and keys should be unique). 


Let x be the root of subtree S,. 
Add the record to disk. 


Insert k (with the record’s disk address) into x between k,_, and k;. 


Add a dummy leaf node to x. 


If degree(x) = m, repeat steps 11—13 until degree(x) < m. 


Let k; be the middle key in node x. 


Let u and v be the left and right halves of x after removing k, from x. 
If x is the root, create a new root node containing k, with subtrees uw and v. 
Otherwise, insert k, in x’s parent node and attach subtrees w and v. 


Return true. 


This insertion process is illustrated in Figure 12.6. 


x 


k, 23 


Figure 12.6 Inserting into a B-tree 


The deletion algorithm for B-trees is similar to the insertion algorithm. 

All three algorithms run in time proportional to the height of the tree. From Corollary 10.1 on 
page 188 it follows that that height is proportional to log,,”. From Theorem A.2 on page 320, it 
follows that that is proportional to lgv. Thus we have: 


Theorem 12.1 In a B-tree, searching, inserting, and 


deleting all run in O(lg7) time. 
BINARY SEARCH TREES 


A binary search tree is a binary tree whose elements 
include a key field of some ordinal type and which has 
this property: If k is the key value at any node, then k = x 
for every key x in the node’s left subtree and k < y for 


Figure 12.7 A binary search tree 
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every key y in the node’s right subtree. This property, called the BST property, guarantees that an 
inorder traversal of the binary search tree will produce the elements in increasing order. 
The BST property is applied for each insertion into the tree: 


Algorithm 12.3 Inserting into a binary search Tree 
To insert an element with key value k into a binary search tree: 


1. Ifthe tree is empty, insert the new element at the root. Then return. 
2. Let p locate the root. 


3. If k is less than the key stored at p and if the node at p has no left child, 
insert the new element as the left child of p. Then return. 


4. Ifk is less than the key stored at p and if the node at p has a left child, let p 
locate that left child of p. Then go back to step 3. 


5. If the node at p has no right child, insert the new element as the right child 
of p. Then return. 


6. Let p locate the right child of p. Then go back to step 3. 
EXAMPLE 12.4 Inserting into a Binary Search Tree 


Apply Algorithm 12.3 to insert an element with key M into the binary search tree shown in Figure 12.7. 

Step | starts the iterator p at the root K. Since M is 
greater than K (i.e., it follows it lexicographically) and 
node K has a right child, the algorithm proceeds to step 
6, resetting the iterator p to node P, and then goes back 
to step 3. Next, since M is less than P (i.e., it precedes it 
lexicographically) and node P has a left child, the 
algorithm proceeds to step 4, resetting the iterator p to 
node N, and then goes back to step 3. Next, since M is 
also less than N but node N has no left child, the 
algorithm proceeds to step 5, inserts the new element as 
the left child of node N, and then returns. 

This is illustrated in Figure 12.8. 


Figure 12.8 A binary search tree 


EXAMPLE 12.5 Building a Binary Search Tree 


Figure 12.9 on page 236 shows the binary search tree that is built by inserting the input sequence 44, 
22, 77, 55, 99, 88, 33. 


If a binary search tree is balanced, it allows for very efficient searching. As with the binary 
search, it takes O(lgv) steps to find an element in a balanced binary search tree. But without 
further restrictions, a binary search tree may grow to be very unbalanced. The worst case is when 
the elements are inserted in sorted order. In that case the tree degrades to a linear list, thereby 
making the search algorithm an O(n”) sequential search. 


EXAMPLE 12.6 An Unbalanced Binary Search Tree 


This is the same input data as in Example 12.5, but in a different order: 99, 22, 88, 33, 77, 55, 44. The 
resulting binary search tree is shown in Figure 12.10 on page 236. 

This shows that the same input in different order produces a different tree. But more important, it 
shows that it is not unlikely for the binary search tree to be linear, or nearly linear. 
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Figure 12.9 Inserting into a binary search tree 


PERFORMANCE OF BINARY SEARCH TREES 


Both the insert() and the search() functions begin at the root of 
the tree and proceed down toward the leaves, making one comparison at 
each level of the tree. Therefore the time required to execute either 
algorithm is proportional to ) + 1, where h is the height of the tree. The 
search() function may terminate before reaching a leaf, but / + 1 is 
still an upper bound on the number of comparisons that it can make. 


Theorem 12.2 In a binary search tree of size n, the insert() and 
the search() functions each require O(lgm) comparisons in the best 
case. 

In the best case, the binary tree is completely balanced and nearly 
full, so by Corollary 11.2 on page 202, h = lg(m+1) — 1 = O(ign). 


Theorem 12.3 In a binary search tree of size n, the insert() and 
the search() functions each require O(”) comparisons in the worst 
case. 

In the worst case the tree is linear, soh + 1 =n = O(n). 


Figure 12.10 A BST 


Theorem 12.4 In a binary search tree of size n, the insertQ and the search() functions 


each require O(2Inn) ~ O(1.39lgn) comparisons in the average case. 
The proof of this result is beyond the scope of this outline. 
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AVL TREES A 


The imbalance problem illustrated in B 
Example 12.6 can be avoided by imposing sf 
balance constraints on the nodes of the D 
binary search tree. \ 

Define the balance number at any node 
of a tree to be the difference between the 0 9 
height of its left subtree and the height of 
its right subtree. An AVL tree is a binary 


search tree where the balance number at Figure 12.11 An AVL tree 


each node is either —1, 0, or 1. The name 
comes from the two inventors of this 
method: G.M. Adel’son-Velskii and Y.M. Landis. 
The tree in Figure 12.12 is not an AVL A 


tree because it is imbalanced at node C. we os 
B 


Its balance number there is 2, which is 


5 ¢ 
outside the allowable range. It is also fo : SS of ¥ Se 
imbalanced at node G The tree in Figure a 


12.11 is an AVL tree: Every balance 2 ¥ 
number is either —1, 0 or 1. A ‘ / a 
H F g & 


EXAMPLE 12.7 An AVLTree CLASS 


This class for AVL trees extends the 
BinaryTree class defined in Example 11.20 Figure 12.12 Not an AVL tree 
on page 212: 
public class AVLTree extends BinaryTree { 


1 

2 protected AVLTree left, right; 

3 protected int balance; 

4 protected java.util.Comparator comp; 
5 

6 public AVLTree(java.util.Comparator comp) { 
7 this.comp = comp; 

8 } 

9 

10 public AVLTree(Object root, java.util.Comparator comp) { 
1 this.root = root; 

12 this.comp = comp; 

13 } 

14 

15 public boolean add(Object object) { 
16 AVLTree temp = attach(object) ; 

17 if (temp != this) { 

18 left = temp. left; 

19 right = temp.right; 

20 balance = temp.balance; 

21 } 

22 return true; 


23 } 


2a 
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25 public AVLTree attach(Object object) { 
26 if (root == null) { // tree is empty 
27 root = object; 

28 return this; 

29 } 

30 if Ccomp.compare(object,root) < 0) { // insert into left subtree 
31 if Cleft == null) f{ 

32 left = new AVLTree(object, comp) ; 
33 ++51zZe; 

34 --balance; 

35 t+ else { 

36 int Ib = left.balance; 

37 left = left.attach(object) ; 

38 if Cleft.balance != Ib && left.balance != 0) { 
39 --balance; 

40 } 

a } 

42 if (balance < -1) { 

43 if Cleft.balance > 0) { 

44 left = left.rotateLeftQ; 

45 } 

46 return rotateRightQ; 

47 

48 } else { // insert into right subtree 
49 if Cright == null) { 

50 right = new AVLTree(object, comp) ; 
51 ++51zZe; 

52 ++balance; 

53 t+ else { 

54 int rb = right.balance; 

55 right = right.attach(object) ; 

56 if Cright.balance != rb && right.balance != 0) { 
57 ++balance; 

58 } 

59 } 

60 if (balance > 1) f{ 

61 if Cright.balance < 0) { 

62 right = right.rotateRightQ) ; 

63 } 

64 return rotateLeftQ; 

65 } 

66 } 

67 return this; 

68 } 

69 

70 private AVLTree rotateRight() // see Problem 12.5 on page 240 
71 

72 private AVLTree rotateLeftQ) f{ 

73 AVLTree x = this, y = right, z = y. left; 
74 x.right = Zz; 

75 y. left = x; 

76 int xb = x.balance, yb = y.balance; 

77 if (yb < 1) f 

78 --x. balance; 


79 y.balance = ( xb>0 ? yb-1 : xb+yb-2 ); 
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80 3} else if (yb < xb) { 

81 x.balance -= yb+1; --y.balance; 
82 } else { 

83 y.balance = xb-2; 

84 } 

85 return y; 

86 } 

87 } 


EXAMPLE 12.8 Building an AVL Tree 


Insertions of G, M, T, D, and P into an empty AVL tree are shown in Figure 12.13. 


ee 


Figure 12.13 Inserting into an AVL tree 


The first rotation occurs with the insertion of T. That increases the balance at the root to 2, which 
violates the AVL constraint. The left rotation about the root x makes M become the parent of its prior 
parent G 

The next rotation occurs after E in inserted. The right rotation at its parent D straightens out the dogleg 
G — D — E but leaves the balance at G at —2. That requires a second rotation in the opposite direction. 
Double rotations like this are required when the imbalance is at the top of a dogleg. 

Note how efficient the rotations are. By making only local changes to references and balance numbers, 
they restore the tree to nearly perfect balance. 

Figure 12.14 on page 240 shows a later insertion into the same AVL tree, inserting W after U, V, and Z 
have been inserted. 

This illustrates a double rotation where a nontrivial subtree gets shifted. The subtree containing U is 
shifted from parent V to parent T. Note that the BST property is maintained. 
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Figure 12.14 AVL tree rotations 


Although a bit complicated, the insertion algorithm for AVL trees is very efficient. The 
rotations that keep it balanced make only local changes to a few references and balance numbers. 
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Review Questions 


What are the advantages and disadvantages of using a binary search tree? 


What are the advantages and disadvantages of using an AVL tree? 


Problems 


Describe what happens in the five-way tree shown in Example 12.1 on page 230 when a new 
record with key 16 is inserted. 


Find two other orderings of the seven keys in Example 12.5 on page 235 that will produce the 
same binary search tree. 


Describe a method for sorting arrays of objects using a binary search tree. Then determine 
the complexity of the algorithm. 


Determine which of the binary trees shown in Figure 12.15 on page 241 is a binary search 
tree. 


Write the rotateRight() method for the AVLTree class. 
Prove that every subtree of a binary search tree is also a binary search tree. 
Prove that every subtree of an AVL tree is also an AVL tree. 


Here are the U.S. Postal Service abbreviations of the first 10 states, in the order that they rat- 
ified the U.S. Constitution: DE, PA, NJ, GA, CT, MA, MD, SC, NH, VA. Show the AVL tree 
after the insertion of each of these strings. 
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12.1 


12.2 


12.1 


12.2 


12.3 


12.4 


Figure 12.15 Binary trees 


Answers to Review Questions 


The disadvantage of a binary search tree is that it may become very unbalanced, in which case search- 
ing degenerates into an O(n) algorithm. The advantage is the efficiency that a binary tree provides for 
insertions and deletions. 


The advantage of an AVL tree is that it is always balanced, guaranteeing the O(|gn) speed of the binary 
search algorithm. The disadvantages the complex rotations used by the insertion and removal algo- 
rithms needed to maintain the tree’s balance. 


Solutions to Problems 


To insert a new record with key 16 into the tree shown in Figure 12.16, the initial search would lead to 
the first leaf node. Since that is a five-way search tree, that first leaf node has overflowed, causing it to 
be split into two leaf nodes and moving its middle key 19 up to its parent node, as shown in Figure 
12.17. But now that parent node has overflowed. So it also gets split, moving its middle key up to its 
parent node, as shown in Figure 12.18. 


Two other ordering of the seven keys in Example 12.5 on page 235 that will produce the same BST: 
a. 44, 22, 33, 77, 55, 99, 88 
b. 44, 22, 77, 33, 55, 99, 88 

An array of objects could be sorted by inserting their objects into a binary search tree and then using 


an inorder traversal to copy them back into the array. The BST property guarantees that the inorder tra- 
versal will visit the elements in order. 


If an AVL tree is used, then each insertion runs in O(lgz7) time, so building the tree with n 
elements will require O(7 lg) time. The subsequent inorder traversal also has O(7 1g) complexity, so 
the entire algorithm sorts the array in O(mlgn) time. 


All except a are binary search trees. 


242 SEARCH TREES [CHAP. 12 


Figure 12.17 Inserting the key 16 in a five-way search tree 


Figure 12.18 Inserting the key 16 in a five-way search tree 


12.5 private AVLTree rotateRight() { 
AVLTree x = this, y = left, z = y. left; 
xX. left = Z; 
y. left = x; 
int xb = x.balance; 
int yb = y.balance; 
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if (yb > 1) { 
++x.balance; 
y.balance = ( xb<0O ? yb+1 : xb+yb+2 ); 
else if (yb > xb) { 
x.balance += yb-1; 
++y.balance; 
} else { 
y.balance = xb+2; 
} 
return y; 


} 


w 


12.6 Theorem. Every subtree of a binary search tree is a binary search tree. 


Proof: Let T be a binary search tree, and let S be a subtree of 7: Let x be any element in S, and let Z and 
R be the left and right subtrees of x in S. Then, since S is a subtree of 7; x is also an element of 7; and L 
and R are the left and right subtrees of x in 7. Therefore, y <x <z for every y € L andeveryz € R 
because T has the BST property. Thus, S also has the BST property. 

12.7. Theorem. Every subtree of an AVL tree is an AVL tree. 


Proof: The proof that every subtree of a binary search tree is a binary search tree is given in Problem 
12.6. If a S is a subtree of an AVL tree J, then every node is S is also in 7. Therefore, the balance 
number at every node in S is —1, 0, or 1. 


12.8 = The solution is shown in Figure 12.19. 


Dee Ss Dp 
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Figure 12.19 AVL tree insertions 
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Figure 12.19 (continued) AVL tree insertions 
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Heaps and Priority Queues 


HEAPS 


A heap is a complete binary tree whose elements have keys that satisfy the following heap 
property: the keys along any path from root to leaf are descending (i.e., nonincreasing). 


EXAMPLE 13.1 A Heap 


Figure 13.1 shows a heap. Note that the keys along 
each of its root-to-leaf paths are descending: 
77 = 66 = 44 > 22; 
77 = 66 = 44> 41; 
77 = 66 = 60 = 58; 
77 = 66 = 60 = 25; 
77 = 55 = 33 = 29; 
772552 55. Figure 13.1 A heap 


Heaps could represent family descendant trees because the heap property means that every 
parent is older than its children. 

Heaps are used to implement priority queues (page 247) and to the heap sort algorithm 
(page 266). 


THE NATURAL MAPPING 


Every complete binary tree has a natural mapping into an array. (See Algorithm 11.1 on page 
206.) The mapping is obtained from a level-order traversal of the tree. In the resulting array, the 
parent of the element at index i is at index i/2, and the children are at indexes 2 and 2i+1. 


EXAMPLE 13.2 Storing a Heap in an Array 


The heap shown in Figure 13.1 maps into the array 01234 5 6 7 8 9 0 1 12 
shown Figure 13.2. 77|66|55/44|60|33]55) 22/41 |58|25|29 

For example, element 60 is at index i = 5, its parent is 
element 66 at index i/2 = 2, and its children are elements 
58 and 25 at indexes 27 = 10 and 27+ 1=11. 


Figure 13.2 Array storage of a heap 
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The natural mapping between a complete binary 
tree and an array is a two-way correspondence. To 
map the array elements back into a complete binary 
tree, simply number the tree nodes consecutively in 
a level-order traversal beginning with number | at 
the root. Then copy the array element at index i into 
the tree node numbered 7. The locations for those 
indexes are shown in Figure 13.3. If the resulting 
tree has the heap property, then we also say that the 
array has the heap property. 


Figure 13.3 Array indexes in a heap 


EXAMPLE 13.3 Determining Whether an Array Has the Heap Property 


To determine whether this array has the heap 
eed 5 O 24... 2s 3) 4 Oe 8 910 
property, we first map it into a binary tree, and then 381661771 33]44|55]75| 30122154 
check each root-to-leaf path. 

The root-to-leaf path {88, 66, 44, 51} shown 
in Figure 13.4 is not descending because 44 < 51. 
Hence, the tree does not have the heap property. 
Therefore, the array does not have the heap property. 


An array with the heap property is partially 
ordered. That means that most of the larger keys 
come before most of the smaller keys. More 
precisely, it means that every heap-path subar- 
ray is sorted in descending order, where a heap- 
path subarray is a subsequence of array 
elements in which each index number is half 
that of its successor. For example, {a[1], a[2], Figure 13.4 Checking the heap property 
a[5], a[11], a[22], a[45], a[90], a[180]} 
would be a heap-path subarray of an array a[] of 200 elements. The heap sort algorithm 
(Algorithm 14.8 on page 266) exploits this fact to obtain a fast and efficient method for sorting 
arrays. 


INSERTION INTO A HEAP 


Elements are inserted into a heap next to its right-most leaf at the bottom level. Then the heap 
property is restored by percolating the new element up the tree until it is no longer “older” (i.e., 
its key is greater) than its parent. On each iteration, the child is swapped with its parent. 


EXAMPLE 13.4 Inserting into a Heap 


Figure 13.5 on page 247 shows how the key 75 would be inserted into the heap shown in Figure 13.4. 
The element 75 is added to the tree as a new last leaf. Then it is swapped with its parent element 44 
because 75 > 44. Then it is swapped with its parent element 66 because 75 > 66. Now the heap property 
has been restored because the new element 75 is less than its parent and greater than its children. 


Note that the insertion affects only the nodes along a single root-to-leaf path. 
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Figure 13.5 Inserting 75 into a heap 


REMOVAL FROM A HEAP 


The heap removal algorithm always removes the root element from the tree. This is done by 
moving the last leaf element into the root element and then restoring the heap property by perco- 
lating the new root element down the tree until it is no longer “younger” (i.e., its key is less) than 
its children. On each iteration, the parent is swapped with the older of its two children. 


EXAMPLE 13.5 Removing from a Heap 


Figure 13.6 shows how the root element (key 88) would be removed from a heap. 

The last leaf (key 44) is removed and copied into the root, replacing the previous root (key 88), which is 
removed. Then, to restore the heap property, the element 44 is swapped with the larger of its two children 
(77). That step is repeated until the element 44 is no longer smaller than any of its children. In this case, 
the result is that 44 ends up as a leaf again. 


Note that the removal affects only the nodes along a single root-to-leaf path. That gives us this 
result from Corollary 11.2 on page 202: 


Theorem 13.1 Insertions into and removals from a heap run in O(ign) time. 
PRIORITY QUEUES 


A stack is a LIFO container: The last one in comes out first. A queue is a “FIFO container: 
The first one in comes out first. A priority queue is a “BIFO container”: The best one in comes 


248 HEAPS AND PRIORITY QUEUES [CHAP. 13 


Figure 13.6 Deleting 88 from a heap 


out first. That means that each element is assigned a priority number, and the element with the 
highest priority comes out first. 

Priority queues are widely used in computer systems. For example, if a printer is shared by 
several computers on a local area network, the print jobs that are queued to it would normally be 
held temporarily in a priority queue wherein smaller jobs are given higher priority over larger 
jobs. 

Priority queues are usually implemented as heaps since the heap data structure always keeps 
the element with the largest key at the root and its insertion and removal operations are so 
efficient. According to Theorem 13.1, those operations are guaranteed to run in in O(/g7) time. 


THE JCF PriorityQueue CLASS 


The Java Collections Framework includes a PriorityQueue class. As Figure 4.1 on page 70 
shows, that class extends the AbstractQueue and AbstractList classes, implementing the 
Queue and List interfaces. 


EXAMPLE 13.6 The java.util .PriorityQueue Class 


public class TestingPriorityQueues { 


1 

2 public static void main(String[] args) { 

3 PriorityQueue<String> pq = new PriorityQueue<String>() ; 
4 pq.addC("FR"); 

5 pq.add("DE"); 

6 pq.add("GB"); 
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} 


pq.addC"IT"); 

pq.add("ES"); 

while (!pq.isEmpty()) f 
System.out.printfC"%s 


} 


, pq.remove()); 


} 


The output is: 


DE ES FR GB_ IT 


The collection pq is a priority queue, so its elements are removed according to their priorities. The 


element type for this queue is String, which has its own natural ordering: alphabetical order. So regard- 
less of the order in which they are inserted, they are removed in alphabetical order. 


If the element type has no natural ordering, then PriorityQueue instances will apply the 


compareTo() method to determine priorities among the elements. 


EXAMPLE 13.7 Using Elements that Explicitly Implement the Comparab1e Interface 


oO ON DOD oO FF WN = 


public class TestingPriorityQueues { 


} 


public static void main(String[] args) { 
PriorityQueue<Student> pq = new PriorityQueue<Student>() ; 
pq.add(new StudentC("Ann",44)); 
pq.add(new Student("Bob",99)); 
pq.add(new Student("Cal",33)); 
pq.add(new Student("Don",66)); 
while (!pq.isEmpty()) f{ 
System.out.printf("%s ", pq.remove()); 
} 
} 


class Student implements Comparablef{ 


} 


private String name; 
private int credits; 


public Student(String name, int credits) { 
this.name = name; 
this.credits = credits; 


} 


public int compareTo(Object object) { 
if (object == this) { 
return 0; 
} else if (!Cobject instanceof Student)) { 
throw new I1legalArgumentException("comparing apples and oranges!"); 
} 
Student that = (Student)object; 
return this.credits - that.credits; 
} 
public String toString() { 
return String.format("%s(%d)", name, credits); 


} 


The output is: 


Cal(33) Ann(44) Don(66) Bob(99) 
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The priority queue pq defined at line 3 stores instances of the Student class that is defined at line 14. 
That class is declared to implement the Comparab1e interface, which obliges it to define a compareTo() 
method. That method, defined at line 23, uses the credits field of the Student objects to compare them. 
Students with more credits have higher priority. 

The print loop at line 8 is the same as the one in Example 13.6: It applies the priority queue’s remove () 
method to remove and print the elements according to their ascending priority levels, independently of 
their insertion order (except for equal priorities). 


Review Questions 


13.1. What are the two main applications of heaps? 

13.2. How efficient are insertions into and removals from a heap? 
13.3. Why isa priority queue called a BIFO container? 

13.4 What is the difference between a queue and a priority queue? 
13.5 Why are heaps used to implement priority queues? 


13.6 In the natural mapping of a binary tree into an array a[], why do we start at a[1] instead of 
at a[0]? 


13.7 _‘If it takes an average of 3ms to remove an element from a priority queue with 1,000 ele- 
ments, how long would you expect it to take to remove an element from a priority queue with 
1,000,000 elements? 


13.8 Suppose a method is devised to sort an array by storing its element in a priority queue and 
then removing them back into the array. What is the run time for such an algorithm? 


Problems 


13.1. Determine which of the binary trees in Figure 13.7 is a heap. 


Figure 13.7 Binary trees 


13.2 Determine which of the arrays in Figure 13.8 on page 251 has the heap property. 
13.3. Show the heap after inserting each of these keys in this order: 44, 66, 33, 88, 77, 77, 22. 
13.4 Show the array obtained from the natural map of each of the heaps obtained in Problem 13.3. 
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13.5 


13.6 
13.7 


13.1 
13.2 
13.3 


13.4 


13.5 


13.6 


13.7 


13.8 


13.1 
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Figure 13.8 Arrays 


Write and test this method 
boolean isHeapCint[] a) 
// returns true if and only if the specified array 
// has the heap property 


Prove that every subtree of a heap is a heap. 


Show the heap after inserting each of these keys in this order: 50, 95, 70, 30, 90, 25, 35, 80, 60, 
40, 20, 10, 75, 45, 35. 


Answers to Review Questions 


Heaps are used to implement priority queues and the heap sort. (See page 266.) 
Insertions into and removals from a heap are very efficient; they run in O(1g7). 


A priority queue is a “best-in-first-out” container, that is, the element with the highest priority comes 
out first. 


Elements are removed from a queue in the same order in which they are inserted: first-in-first-out. Ele- 
ments in a priority queue must have an ordinal key field which determines the priority order in which 
they are to be removed. 


Heaps are used to implement priority queues because they allow O(lgv) insertions and removals. This 
is because both the add() and the remove() methods are implemented by traversing a root-to-leaf 
path through the heap. Such paths are no longer than the height of the tree which is at most lgn. 


The natural mapping starts at a[1] instead of a[0] to facilitate navigation up and down the heap tree. 
By numbering the root | and continuing sequentially with a level order traversal, the number of the 
parent of any node numbered k will be 4/2, and the numbers of its child nodes will be 24 and 2k+1. 


If it takes an average of 3ms to remove an element from a priority queue with 1,000 elements, then it 
should take about 6ms to remove an element from a priority queue with 1,000,000 elements. 


The run time for a method that uses a priority queue to sort an array would be O(2n1gn) because it 
will make 7 insertions and n removals, each running in O(lgz) time. 


Solutions to Problems 


a. This is not a heap because the root-to-leaf path {88, 44, 77} is not descending (44 < 77). 

b. This is a heap. 

c. This is not a heap because the root-to-leaf path {55, 33, 44} is not descending (33 < 44) and the 
root-to-leaf path {55, 77, 88} is not descending (55 < 77 < 88). 

d. This is not a heap because the binary tree is not complete. 

e. This is a heap. 

f. This is not a heap because the tree is not binary. 
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13.2 a. This array does not have the heap property because the root-to-leaf path {a[1], a[3], a[6]} = 
{88, 44, 77} is not descending (44 < 77). 


This array does have the heap property. 
This array does have the heap property. 


Bes 


This array does not have the heap property because its data elements are not contiguous: It does 
not represent a complete binary tree. 


e. This array does have the heap property. 


This array does not have the heap property because the root-to-leaf path {a[1], a[3], a[6]} = 
{88, 22, 55} is not descending (22 < 55) and the root-to-leaf path {a[1], a[3], a[7]} = 
{88, 22, 66} is not descending (22 < 66). 


13.3. Figure 13.9 shows a trace of the insertion of the keys 44, 66, 33, 88, 77, 55, 22 into a heap. 
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Figure 13.9 Trace of insertions into a heap 


13.4 Figure 13.10 on page 253 shows the arrays for the heaps in Problem 13.3. 
13.5 boolean isHeap(int[] a) { 
// returns true if and only if the specified array 
// has the heap property 
int n = a. length; 
for Cint i = n/2; 1 <n; i++) { 
for (int j = 71; j > 1; j /=2) { 
if (a[j/2] < a[j]l) f 
return false; 
} 
} 
} 


return true; 
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Figure 13.10 Trace of heap insertions into an array 


13.6 Theorem. Every subtree of a heap is also a heap. 


Proof: Let T be a heap, and let S be a subtree 
of T. (See Figure 13.11.) By definition, T is a 
complete binary tree with the heap property. 
Thus, by the theorem in the solution, S' is 
also a complete binary tree. Let x be the root 
of S, and let p be any root-to-leaf path in S. 
Then x is an element of 7’ since S is a subtree 
of 7; and there is a unique path g in T from x 
to the root of 7: Also, p is a path in T that 
connects x to a leaf of T since S is a subtree 
of T. Let g™! represent the reverse of the path 
q, and let g"'p represent the concatenation of 
q' with p in T. Then gp is a root-to-leaf path in 7. Hence the elements along g~'p must be descending 
because T has the heap property. Therefore the elements along p are descending. Thus S also has the 
heap property. 


13.7. _—‘“ Figure 13.12 shows a trace of the insertion of the keys 50, 95, 70, 30, 90, 25, 35, 80, 60, 40, 20, 10, 
75, 45, 35 into a heap. 


Figure 13.11 Subtree of a heap 


Figure 13.12 Insertion into a heap 
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Figure 13.12 (continued) Insertion into a heap 


CHAPTER 14 


Sorting 


We saw in Chapter 2 that searching through an array is far more efficient if its elements are 
sorted first. That fact is obvious to anyone who has ever looked up a number in a phone book or 
a word in a dictionary. This chapter outlines nine of the most common algorithms for sorting a 
linear data structure such as an array or a list. 


CODE PRELIMINARIES 


All of the sorting algorithms are implemented in this chapter using the same Java signature 
public static void sort(Cint[] a) 
for sorting an array of integers. This can easily be modified to sort an array of any other primitive 
type or an array of objects that implement the Comparab1e interface. 
The pseudocode and the Java code include preconditions, postconditions, and loop invariants. 
These are used to prove that the algorithms are correct and to analyze their complexity. 


In the pseudocode we use the notation s = {5 , s),..., 5,1} for a sequence of elements. The 
notation {S, . . . s,,} denotes the subsequence {s,, 8,.,,..-., 5,1} of elements from s, to s,,. In 
Java code comments, we represent the subsequence by s[p..q). For example, {s,...s,} and 


s[3..8) would both denote the subsequence {5;, 54, 55, 56, 57}. 

Unless otherwise noted, “sorted” will always mean that the elements of the sequence are in 
ascending order: sy) <s,;<s,<---<s,.. 

All the exchange sorts use this swap() method: 


EXAMPLE 14.1 A swap() Method for Arrays 


1 private static void swapCint[] a, int i, int j) { 
2 // PRECONDITIONS: 0 <= i < a. length; 0 <= j < a.length; 
3 // POSTCONDITION: a[i] and a[j] are interchanged; 
4 ifdg-=-j)ft 

5 return; 

6 } 

7 int temp=a[j]; 

8 alj] = ali]; 

9 ali] = temp; 

10 } 

This method simply swaps the ith and jth elements of the array. 


255 


256 SORTING [CHAP. 14 


Note the use of preconditions and postconditions, included as comments at lines 2 and 3 of 
Example 14.1. A precondition is a condition that is assumed to be true before the method is 
invoked. A postcondition is a condition that is guaranteed to be true after the method has been 
invoked, provided that the preconditions were true. Preconditions and postconditions define the 
contract for a method: the “consumer guarantee” that defines the method. They can be used to 
prove, logically, that the method will always “work” as expected. (For example, see Theorem 
14.10 on page 262.) 

All the array examples use this print() method: 


EXAMPLE 14.2 A print() Method for Arrays 


private static void printCint[] a) { 
for Cint ai: a) f{ 
System.out.printfC"%s 


} 
System.out.printiInQd; 
} 
Note the use of Java’s enhanced for loop construct at line 2. The variable ai represents array elements 
a[i] as the index i traverses its entire range from 0 to a. length -— 1. 


, ai); 


oa fF WN = 


THE JAVA Arrays.sortQ) METHOD 


The standard Java class library defines a sort() method in the java.util.Arrays class. It 
actually includes twenty overloaded versions of the method: four for arrays of objects, two for 
arrays of each primitive type except boolean, and two for generic types. (See page 95.) 

The signatures for the two sort() methods for arrays of ints are 

public static void sort(Cint[] a) 
public static void sortCint[] a, int p, int q) 
The first of these sorts the entire array. The second sorts the subarray a[p. .q). 


EXAMPLE 14.3 Using the Arrays.sort() Method 


public static void main(String[] args) { 


1 

2 int{] a= { 77, 44, 99, 66, 33, 55, 88, 22 }; 
3 print(a); 

4 java.util.Arrays.sort(a); 

5 print(a); 

6 


The output is: 
77 44 99 66 33 55 88 22 
22 33 44 55 66 77 88 99 


For arrays of elements of a primitive type, the Arrays.sort() method implements the quick 
sort. (See Algorithm 14.6 on page 263.). For arrays of elements of a reference type, it imple- 
ments the merge sort. (See Algorithm 14.5 on page 261.) 


THE BUBBLE SORT 


The bubble sort makes n—1 passes through a sequence of m elements. Each pass moves 
through the array from left to right, comparing adjacent elements and swapping each pair that is 
out of order. This gradually moves the larger elements to the right. It is called the bubble sort 
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because if the elements are visualized in a vertical column, then each pass appears to “bubble up” 
the next largest element by bouncing it off smaller elements, much like the rising bubbles in a 
carbonated beverage. 


Algorithm 14.1 The Bubble Sort 
(Precondition: s = {sp ...s,_;} is a sequence of v ordinal values.) 
(Postcondition: The entire sequence s is sorted.) 
1. Do steps 2-4 for i=n-—1 down to 1. 
2. Do step 3 for 7 = 0 up to 7-1. 
3. If the two consecutive elements s; and s,,,, are out of order, swap them. 
4. (Invariants: The subsequence {s,...s, ,} is sorted, and s,= max{s,...5,}.) 


EXAMPLE 14.4 The Bubble Sort 


public static void sortCint[] a) { 


1 

2 // POSTCONDITION: a[O] <= a[1] <= ... <= a[a.length-1]; 
3 for (int i = a.length-1; i > 0; i--) { // step 1 

4 for Cint j = 0; j < i; j++) { // step 2 

5 if Calj] > alj+1]) f{ 

6 swap(a, j, j+l); // step 3 

7 } 

8 } 

9 // INVARIANTS: a[i] <= a[i+1] <= ... <= a[a.length-1]; 
10 // afj] <= afi] for all j <i; 

11 } 


Theorem 14.1 The Bubble Sort is correct. 
See the solution to Problem 14.14 on page 276 for a proof of this theorem. 


Theorem 14.2 The Bubble Sort runs in O(n’) time. 
See the solution to Problem 14.15 on page 276 for a proof of this theorem. 


THE SELECTION SORT 


The selection sort is similar to the bubble sort. It makes the n—1 passes through a sequence of 
n elements, each time moving the largest of the remaining unsorted elements into its correct 
position. But it is more efficient than the bubble sort because it doesn’t move any elements in the 
process of finding the largest. It makes only one swap on each pass after it has found the largest. 
It is called the selection sort because on each pass it selects the largest of the remaining unsorted 
elements and puts it in its correct position. 


Algorithm 14.2 The Selection Sort 
(Precondition: s = {sp ...S,_;} is a sequence of 7 ordinal values.) 
(Postcondition: The entire sequence s is sorted.) 
1. Do steps 2—4 for i=n-—1 down to 1. 
2. Locate the index m of the largest element among {s)..5,}. 
3. Swap s,ands,,. 
4. (Invariants: the subsequence {s;...s,,} is sorted, and s;= max{s,...s,}.) 
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EXAMPLE 14.5 The Selection Sort 


public static void sort(Cint[] a) { 


1 

2 // POSTCONDITION: a[O] <= a[1] <= ... <= a[a.length-1]; 
3 for (int i = a.length-1; i > 0; i--) { // step 1 
4 int m = 0; // step 2 
5 for Cint j = 1; j <= i; j++) f{ 

6 if (a[j] > alm]) { 

7 m= J; 

8 } 

9 } 

10 // INVARIANT: a[m] >= a[j] for all j <= 7; 

"1 swap(a, i, m); // step 3 
12 // INVARIANTS: a[j] <= ali] for all j <= 7; 

13 // afi] <= afi+l] <= ... <= a[a.length-1]; 
14 } 

15 } 


Theorem 14.3 The selection sort is correct. 
See the solution to Problem 14.19 on page 276 for a proof of this theorem. 


Theorem 14.4 The selection sort runs in O(n’) time. 
See the solution to Problem 14.20 on page 276 for a proof of this theorem. 


Note that even though the bubble sort and the selection sort have the same complexity 
function, the latter runs quite a bit faster. That fact is suggested by the two traces: The bubble sort 
made 18 swaps while the selection sort made only 7. The selection sort has the advantage of 
swapping elements that are far apart, so it makes one swap where the bubble sort could require 
several. (See Exercise 11.8.) 


THE INSERTION SORT 


Like the two previous algorithms, the insertion sort makes n —1 passes through a sequence of 
n elements. On each pass it inserts the next element into the subarray on its left, thereby leaving 
that subarray sorted. When the last element is “inserted” this way, the entire array is sorted. 


Algorithm 14.3 The Insertion Sort 
(Precondition: s = {s,...s,_,} is a sequence of 7 ordinal values.) 
(Postcondition: The entire sequence s is sorted.) 
1. Do steps 2-4 fori=1upton—-l. 
Hold the element s; in a temporary space. 
Locate the least index j for which s,>=s,. 
Shift the subsequence {s,...5,,} up one position to {5,,...5;}. 
Copy the held value of s; into s,. 
6. (Invariant: the subsequence {s,..5,} is sorted.) 


ns. er NS 


EXAMPLE 14.6 The Insertion Sort 


1 public static void sort(Cint[] a) { 
2 // POSTCONDITION: a[O] <= a[1] <= ... <= a[a.length-1]; 
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3 for (int i = 1; i < a.length; i++) { // step 1 
4 int ai = ali], j; // step 2 
5 for (j = 1; j > O && a[j-1] > ai; j--) { // step 3 
6 a{j] = a[j-1]; // step 4 
7 

8 alj] = ai; // step 5 
9 // INVARIANT: a[O] <= a[1] <=... <= a[i]; 

10 } 

1 } 


Theorem 14.5 The insertion sort is correct. 
See the solution to Problem 14.23 on page 277 for a proof of this theorem. 


Theorem 14.6 The insertion sort runs in O(n’) time. 
See the solution to Problem 14.24 on page 277 for a proof of this theorem. 


Theorem 14.7 The insertion sort runs in O(”) time on a sorted sequence. 
See the solution to Problem 14.25 on page 283 for a proof of this theorem. 


THE SHELL SORT 


Theorem 14.7 suggests that if the sequence is nearly sorted, then the insertion sort will run 
nearly in O(n) time. That is true. The she// sort exploits that fact to obtain an algorithm that in 
general runs in better than O(n!) time. It applies the insertion sort repeatedly to skip subse- 
quences such as {5p, 53, S65 Soy - + + 5 S, 2 and {8), 54, 87, Sio,.- + 5 S,1}- These are two of the three 
skip-3-subsequences. 


Algorithm 14.4 The Shell Sort 
(Precondition: s = {s,...s,_;} is a sequence of 7 ordinal values.) 
(Postcondition: The entire sequence s is sorted.) 


1. Setd=1. 

2. Repeat step 3 until 9d>n. 

3. Setd=3d+1. 

4. Do steps 5—6 until d= 0. 

5. Apply the insertion sort to each of the d skip-d-subsequences of s. 
6. Set d=d/3. 


Suppose that s has n = 200 elements. Then the loop at step 2 would iterate three times, increas- 
ing d from 1 to d= 4, 13, and 40. 

The first iteration of the loop at step 4 would apply the insertion sort to each of the 40 skip-40- 
subsequences {3, 8495 S05 51209 Si60fs {Sis Sais Sgis Sits Storts (S25 S425 S25 S1225 Sterhs + + + 5 18309 S799 St195 
S159) Si99$. Then step 6 would reduce d to 13, and then the second iteration of the loop at step 4 
would apply the insertion sort to each of the thirteen skip-13-subsequences {59, 813, S265 5395 S525 S659 


2+ Stoats {815 Stas S27» S40 S530 S662 ++ + 9 Stosts + ++ > (S129 S250 S3gs Ssis Seas S77 + + + » Sig3f- Then step 6 
would reduce d to 4, and the third iteration of the loop at step 4 would apply the insertion sort to 
each of the four skip-4-subsequences {S05 S45 Ss S125 se eg Sio6hs {81 S55 S95 S135 seg Stor}, {8, S65 S105 


Sido s+ +5 Sjogh, and {55, 87, S11, Sys, -- +» Sjo9}. Then step 6 would reduce d to 1, and, and the fourth 
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iteration of the loop at step 4 would apply the insertion sort to the entire sequence. This entire 
process would apply the insertion sort 58 times: 40 times to subsequences of size n, = 5, 13 times 
to subsequences of size n, = 15, 4 times to subsequences of size n, = 50, and once to the entire 
sequence of size n, =n = 200. 

At first glance, the repeated use of the insertion sort within the shell sort would seem to take 
longer than simply applying the insertion sort directly just once to the entire sequence. Indeed, a 
direct calculation of the total number of comparisons, using the complexity function n’, yields 

40(n,7) + 13(n,”) + 4(n,’) + 1(1,7) = 40(57) + 13(15*) + 4(507) + 1(2007) = 53,925 
which is quite a bit worse than the single 
n* = 2007 = 40,000 
But after the first iteration of step 4, the subsequent subsequences are nearly sorted. So the actual 
number of comparisons needed there is closer to n. Thus, the actual number of comparisons is 
more like 
40(n,7) + 13(n,) + 4(15) + 1(14) = 40(5) + 13(15) + 4(50) + 1(200) = 1,595 
which is quite a bit better than 40,000. 


Theorem 14.8 The shell sort runs in O(n'>) time. 
Note that, for 7 = 200, n!5 = 200!5 = 2,829, which is a lot better than n* = 2007 = 40,000. 
EXAMPLE 14.7 The Shell Sort 


public static void sort(Cint[] a) { 


1 

2 // POSTCONDITION: a[O] <= a[1] <= ... <= a[a.length-1]; 
3 int d = 1, j, n = a. length; // step 1 
4 while (9*d < n) { // step 2 
5 d = 3*d + 1; // step 3 
6 } 

7 while (d > 0) { // step 4 
8 for Cint i = d; i < n; i++) { // step 5 
9 int ai = ali]; 

10 ye 15 

1 while (j >= d && a[j-d] > ai) { 

12 alj] = alj-d]; 

13 j -=d; 

14 } 

15 a[j] = ai; 

16 } 

17 d /= 3; // step 6 
18 } 

19 } 


THE MERGE SORT 


The merge sort applies the divide-and-conquer strategy to sort a sequence. First it subdivides 
the sequence into subsequences of singletons. Then it successively merges the subsequences 
pairwise until a single sequence is re-formed. Each merge preserves order, so each merged subse- 
quence is sorted. When the final merge is finished, the complete sequence is sorted. 

Although it can be implemented iteratively, the merge sort is naturally recursive: Split the 
sequence in two, sort each half, and then merge them back together preserving their order. The 
basis occurs when the subsequence contains only a single element. 
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Algorithm 14.5 The Merge Sort 
(Precondition: s = {s, ...S,,} 18 a sequence of g — p ordinal values.) 
(Postcondition: The entire sequence s is sorted.) 

1. Ifqg—p> 1, do steps 2-5. 


2. Split s into two subsequences, a = {s,...5,,,} and b = {s,,...5,,}, where 
m= (q — p)/2. 

3. Sort a. 

4. Sort b. 


5. Merge a and b back into s, preserving order. 


EXAMPLE 14.8 The Merge Sort 


1 public static void sortCint[] a) { 

2 // POSTCONDITION: a[O] <= a[1] <= ... <= a[a.length-1]; 
3 sort(a, 0, a.length); 

4 } 

5 

6 private static void sort(int[] a, int p, int m, int q) f{ 
7 // PRECONDITIONS: 0 <= p <= q <= a. length; 

8 // POSTCONDITION: a[p..q) is sorted; 

9 if (qq -p<2)f{ // step 1 

10 return; 

1 } 

12 int m= (p + q)/2; // step 2 

13 sort(a, p, m); // step 3 

14 sort(a, m, q); // step 4 

15 merge(a, p, m, q); // step 5 

16 } 

17 

18 private static void mergeC(int[] a, int p, int m, int q) { 
19 // PRECONDITIONS: 0 <= p <= ™m < q <= a. length; 
20 // a[p..m) is sorted; 

21 // a[m..q) is sorted; 

22 // POSTCONDITION: a[p..q) is sorted; 

23 if Ca[m-1] <= a[m]) { 

24 return; 

25 } 

26 int i=p, j =m, k = 0; 

27 int[] tmp = new int[q-p]; 

28 while Gi < m & j <q) f{ 

29 // INVARIANT: tmp[0..k) is sorted 

30 tmp[k++] = ( ali]<=a[j] ? alit+] : a[j++] ); 
31 

32 System.arraycopy(a, i, a, ptk, m-i); 

33 System.arraycopy(tmp, 0, a, p, k); 

34 } 


The main sort() method sorts the entire array by invoking the overloaded sort() method with 
parameters for the starting index k and the length n of the subarray. That three-parameter method sorts the 
subarray by sorting its left half and its right half separately and then merging them. 

The merge() method merges the two halves a[p..m) and a[m. .q) into a temporary array, where m is 
the middle index m = p + n/2. The while loop copies one element on each iteration; it copies the smaller 
of the two elements a[i] and a[j]. The post increment operator automatically advances the index of the 
copied element. When all the elements of one half have been copied, the while loop stops and then all the 
elements are copied back into a[]. 
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Theorem 14.9 The merge sort runs in O(1 lgn) time. 

In general, the merge sort works by [77] 44| 99] 66|33|55|88|22| 
repeatedly dividing the array in half until the 
pieces are singletons, and then it merges the Pa \ 
pieces pairwise until a single piece remains. |77|44[ 99] 66| 
This is illustrated by the diagram in Figure \ 


14.1. The number of iterations in the first 


part equals the number of times n can be 


halved: that is, Ign — 1. In terms of the / \ / \ i \ / \ 
number and sizes of the pieces, the second 
part of the process reverses the first. So the \ / \ / \ / \ / 
second part also has lgn — 1 steps. So the 


compares all m elements. So the total 


entire algorithm has O(1gn) steps. Each step 


number of comparisons is O(n lg). [44[66|77|99) 
Theorem 14.10 The merge sort is correct. \ y 
The proof follows from the preconditions 22]33] 44/55] 66] 77/88/99) 


and postconditions given in the code. In the 
main sort() method, the array is already 
sorted if its length is 0 or 1. Otherwise, the postcondition of the three-parameter sort() method 
guarantees that the array will be sorted after that method returns because the entire array is 
passed to it. That postcondition is the same as the postcondition of the merge() method, which is 
invoked last, so it remains to verify that the merge() method’s postcondition will be true. 

The merge() method’s postcondition follows from its loop invariant, because when that loop 
has finished, the tmp[] array is sorted and that is copied back into a[] in the same order. So it 
remains to verify the loop invariant for allk < q - p. 

Suppose the invariant is false for some 
k, that is, tmp[0..k) is not sorted. Then 
there must be some x and y in tmp[0. .k), 
where x was copied into tmp[] before y 
but x > y. We may assume without loss of 
generality that x was copied from the left 
half of a[] and y from the right half, as 
shown in Figure 14.2. Thus, x = a[r] and 
y = a[s] for some indexes r and s such 
that p< r<iandm<s <j. Now the two halves of a[] are each already separately sorted. Then 
for every element z in a[m..s], z < a[s]. But a[s] = y < x. Therefore, every element z in 
a[m..s] must have been copied into tmp[] before x was, because this assignment 

tmp[k++] = ( aLi]<=alj] ? ali++] : alj++] ); 
always copies the smaller element first. But that means that a[s] was copied into tmp[] before 
x. But a[s] = y, which was assumed to have been copied into tmp[] after x. This contradiction 
proves that the invariant must be true. 


Figure 14.1 The merge sort 


k lo hi 


Figure 14.2 The merge sort 
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By using the divide-and-conquer strategy, the merge sort obtains an O(nlgn) run time, a 
significant improvement over the O(n’) times spent by the previous sorting algorithms. The strat- 
egy is 

1. Split the sequence into two subsequences. 

2. Sort each subsequence separately. 

3. Merge the two subsequences back together. 
The merge sort does the first step in the simplest balanced way possible: It splits the sequence at 
its middle. If the first step is done in other ways, we obtain different sorting algorithms. The 
divide-and-conquer strategy is also used in the binary search (page 31). 

The simplest unbalanced way to split the sequence is to put all but the last element in the first 
subsequence, leaving only the last element in the second subsequence. This produces the 
recursive version of the insertion sort. (See Problem 14.22 on page 277.) 

Another unbalanced way to split the sequence is to put the largest element alone in the second 
subsequence, leaving all the other elements in the first subsequence. This produces the recur- 
sive version of the selection sort. (See Problem 14.18 on page 276.) Not that this makes the 
merge step 3 trivial: Merely append the largest element to the end of the first subsequence. 

A fourth way to split the sequence is to partition it so that every element in the first subse- 
quence is less than every the element in the second subsequence. This condition of course is true 
in the previous case that led to the recursive selection sort. However, if we can obtain this 
property together with having the two subsequences the same size, then we obtain a new 
O(n|gn) algorithm, called the quick sort. 


THE QUICK SORT 


The quick sort is like the merge sort: It is recursive, it requires an auxiliary function with several 
loops, and it runs in O(1gn) time. But in most cases it is quicker than the merge sort. 

The quick sort works by partitioning the array into two pieces separated by a single element x 
that is greater than or equal to every element in the left piece and less than or equal to every 
element in the right piece. This guarantees that the single element x, called the pivot element, is 
in its correct position. Then the algorithm proceeds, applying the same method to the two pieces 
separately. This is naturally recursive and very quick. 


Algorithm 14.6 The Quick Sort 
(Precondition: s = {s,...S,,} is a sequence of gq — p ordinal values.) 
(Postcondition: The entire sequence s is sorted.) 

1. If g—p> 1, do steps 2-5. 
Apply Algorithm 14.7 to s, obtaining the pivot index m. 
(Invariant: the pivot element s,, is in its correct sorted position.) 
Apply the quick sort to {5 , 5),...5 5}. 
Apply the quick sort to {5,..), Sia, . ++ 5 Spats 
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Algorithm 14.7 Partition 

(Precondition: s = {s,...S,,} is a sequence of gq — p ordinal values.) 

(Postcondition: Return m, where p < m<qands,; < s,, < s;forp < iS m<j<q.) 
1. Set x=, (the pivot element). 
2. Seti=pandj=4q. 
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Repeat steps 4—7 while i </. 
Decrement j until either s;< x or j =i. 
If j > i, copy s; into s,. 

Increment i until either s;>x ori =/. 
If j > i, copy s; into s;. 

8. Copy x into s,. 


Sy ae 


EXAMPLE 14.9 The Quick Sort 


1 public static void sort(Cint[] a) { 

2 // POSTCONDITION: a[O] <= a[1] <= ... <= a[a.length-1]; 
3 sort(a, 0, a. length); 

4 } 

5 

6 private static void sortCint[] a, int p, int q) f{ 

7 // PRECONDITION: 0 <= p <= q <= a. length 

8 // POSTCONDITION: a[p..q) is sorted; 

9 if (q-p< 2) f{ 

10 return; 

"1 } 

12 int m = partition(a, p, q); // step 2 

13 sort(a, p, m); // step 4 

14 sort(a, m+1l, q); // step 5 

15 } 

16 

17 private static int partitionCint[] a, int p, int q) { 
18 // RETURNS: index m of pivot element a[m]; 

19 // POSTCONDITION: a[i] <= a[m] <= a[j] for p <= 1 <= m <= j < q; 
20 int pivot = a[p], i =p, j =q;3 // steps 1-2 
21 while Gi < j) f // step 3 

22 while (i < j && a[--j] >= pivot) ; // step 4 

23 ifaqa<jf{ 

24 afi] = a[j]; // step 5 

25 } 

26 while Ci < j && a[++i] <= pivot) ; // step 6 

27 ifaqa<jf{ 

28 alj] = ali]; // step 7 

29 } 

30 } 

31 a[j] = pivot; // step 8 

32 return j; 

33 } 


Note the empty loop at line 22 and line 26. All the action is managed within the loop condition, so no 
statements are in its body. 


Algorithm 14.7 selects the pivot element to be the last element in the sequence. The algorithm 
works just as well if it is selected to be the first element or the middle element. Slightly better 
performance is obtained by selecting the median of those three elements. 

The Java Arrays.sort() method implements the quick sort, selecting the pivot as the 
median of the three elements {5, S,., 5,,} when n < 40, and the median of 9 equally spaced 
elements when n > 40. It also switches to the insertion sort (Algorithm 14.3 on page 258) when 
n<7, 
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Theorem 14.11 The quick sort runs in O(71gn) time in the best case. 

The best case is when the sequence values are uniformly randomly distributed so that each call 
to the quick partition algorithm will result in balanced split of the sequence. In that case, each 
recursive call to the quick sort algorithm divides the sequence into two subsequences of nearly 
equal length. As with the binary search and the merge sort (Algorithm 14.5 on page 261), this 
repeated subdivision takes lgn steps to get down to size 1 subsequences, as illustrated in the 
diagram in Figure 14.2 on page 262. So there are O(|lgn) calls made to the quick partition 
algorithm which runs in O(n”) time, so the total running time for the quick sort algorithm is 
O(nl|gn). 


Theorem 14.12 The quick sort runs in O(n’) time in the worst case. 

The worst case is when the sequence is already sorted (or sorted in reverse order). In that case, 
the quick partition algorithm will always select the last element (or the first element, if the 
sequence is sorted in reverse order), resulting in the most unbalanced split possible: One piece 
has n—2 elements, and the other piece has 1 element. Repeated division of this type will occur 
O(n) times before both pieces get down to size 1. So there are O(n) calls made to the quick parti- 
tion algorithm which runs in O(7) time, so the total running time for the quick sort algorithm is 
O(n’). 

Note that in the worst case, the quick sort reverts to the selection sort (Algorithm 14.2 on page 
257) because each call to quick partition amounts to selecting the largest element from the subse- 
quence passed to it. So actually, Theorem 14.12 is a corollary to Theorem 14.4 on page 258. 


Theorem 14.13 The quick sort runs in O(n lgn) time in the average case. 
The proof of this fact is beyond the scope of this outline. 


Theorem 14.14 The quick sort is correct. 

The invariant inside the while loop proof claims that all the elements to the left of a[i] are 
less than or equal to the pivot element and that all the elements to the right of a[j] are greater 
than or equal to the pivot. This is true because every element to the left of a[i] that is greater 
than the pivot was swapped with some element to the right of a[j] that is less than the pivot, 
and conversely (every element to the right of a[j] that is less than the pivot was swapped with 
some element to the left of ali] that is greater than the pivot. When that loop terminates, j < 
i, so at that point all the elements that are greater than the pivot have been moved to the right of 
a[i], and all the elements that are less than the pivot have been moved to the left of afi]. This 
is the invariant in step 7 of the quick partition algorithm. So after the swap in step 8, all the 
elements that are greater than the a[i] are to the right of a[i], and all the elements that are less 
than the a[i] are to the left of a[i]. This is the invariant in step 7 of the quick partition 
algorithm, which is the same as the invariant in step 3 of the quick sort algorithm. So then sorting 
the left segment and the right segment independently will render the entire sequence sorted. 


THE HEAP SORT 


A heap is by definition partially sorted, because each linear string from root to leaf is sorted. 
(See Chapter 13.) This leads to an efficient general sorting algorithm called the heap sort. As 
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with all sorting algorithms, it applies to an array (or vector). But the underlying heap structure 
(a binary tree) that the array represents is used to define this algorithm. 

Like the merge sort and the quick sort, the heap sort uses an auxiliary function that is called 
from the sort() function. And also like the merge sort and the quick sort, the heap sort has 
complexity function O(nlgn). But unlike the merge sort and the quick sort, the heap sort is not 
recursive. 

The heap sort essentially loads n elements into a heap and then unloads them. By Theorem 
13.1 on page 247, each element takes O(1g7) time to load and O(|gv) time to unload, so the entire 
process on 7 element runs in O(71gn) time. 


Algorithm 14.8 The Heap Sort 
(Precondition: s = {s,...5,_,} is a sequence of 7 ordinal values.) 
(Postcondition: The entire sequence s is sorted.) 
1. Do steps 2—3 for i = n/2 —1 down to 0. 
. Apply the heapify algorithm to the subsequence {s,...5s,;}. 
. (Invariant: every root-to-leaf path in s is nonincreasing.) 
. Do steps 5—7 for i= n-—1 down to 1. 
. Swap s; with so. 
. Invariant: The subsequence {s;.. .s,_;} is sorted.) 
. Apply the heapify algorithm to the subsequence {s)...s,,}. 
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Algorithm 14.9 The Heapify 
(Preconditions: ss = {s;... S;,,} 18 a subsequence of j—i ordinal values, and both subsequences 
{Si S,,} and {5,9 ...5;,,} have the heap property.) 
(Postcondition: ss itself has the heap property.) 
1. Let t= 85,4. 


2. Let s, = max {5y;:,, Soi2$, SO A = 2i+1 or 2i+2, the index of the larger child. 
3. If t< s,, do steps 4-6. 

4. Set s;= 5,. 

5. Seti=k. 

6. If i<n/2 and s;< max {55,;, Sou.}, repeat steps 1-4. 

7. Set s, = t. 


There are two aspects to these algorithms that distinguish them from the methods outlined in 
Chapter 12. The heaps here are in the reverse order, so each root-to-leaf path is descending. And 
these algorithms use 0-based indexing. The reverse order guarantees that heapify will always 
leave the largest element at the root of the subsequence. Using 0-base indexing instead of 1- 
based indexing renders the sort() method consistent with all the other sort() methods at the 
expense of making the code a little more complicated. 


EXAMPLE 14.10 The Heap Sort 


34 public static void sort(Cint[] a) { 

35 // POSTCONDITION: a[O] <= a[1] <= ... <= a[a.length-1]; 
36 int n = a.length; 

37 for Cint i = n/2 - 1; i >= 0; i--) { // step 1 

38 heapify(a, i, n); // step 2 


39 } 
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40 for (int i =n - 1; i> 0; i--) f{ // step 4 
a swap(a, 0, i); // step 5 
42 heapify(a, 0, 1); // step 7 
43 } 

44 } 

45 

46 private static void heapifyCint[] a, int i, int j) { 
47 int ai = a[i]; // step 1 
48 while (2*i+1 < j) { 

49 int k = 2*7 +1; 

50 if ck + 1 < j && a[k+1] > a[k]) { 

51 ++k; // a[k] is the larger child 

52 } 

53 if (ai >= a[k]) { 

54 break; // step 3 
55 } 

56 ali] = alk]; // step 4 
57 i =k; // step 5 
58 } 

59 ali] = ai; // step 7 
60 } 


The sort() function first converts the array so that its underlying complete binary tree is transformed 
into a heap. This is done by applying the heapifyQ function to each nontrivial subtree. The nontrivial 
subtrees (i.e., those having more than one element) are the subtrees that are rooted above the leaf level. In 
the array, the leaves are stored at positions a[n/2] through a[n]. So the first for loop in the sortQ 
function applies the heapify( function to elements a[n/2-1] back through a[0] (which is the root of 
the underlying tree). The result is an array whose corresponding tree has the heap property, illustrated in 
Figure 14.3. 


0 1 2 3 4 5 6 r 8 
polseles[aelsa]ss[7z2[6] <G > 


Figure 14.3 The natural mapping for the heap sort 


Now the main (second) for loop progresses through n-1 iterations. Each iteration does two things: it 
swaps the root element with element a[i], and then it applies the heapify( function to the subtree of 
elements a[0..i). That subtree consists of the part of the array that is still unsorted. Before the swap () 
executes on each iteration, the subarray a[0..i] has the heap property, so a[i] is the largest element in 
that subarray. That means that the swap() puts element a[i] in its correct position. 

The first seven iterations of the main for loop have the effect shown by the seven pictures in Figure 
14.4 on page 268. The array (and its corresponding imaginary binary tree) is partitioned into two parts: 
The first part is the subarray a[0..i) that has the heap property, and the second part is the remaining 
aLli..n) whose elements are in their correct positions. The second part is shaded in each of the seven 
pictures in Figure 14.4 on page 268. Each iteration of the main for loop decrements the size of the first 
part and increments the size of the second part. So when the loop has finished, the first part is empty and 
the second (sorted) part constitutes the entire array. This analysis verifies the following theorem. 
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Figure 14.4 Tracing the heap sort 
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Theorem 14.15 The heap sort is correct. 
See Problem 14.31 on page 277. 


Theorem 14.16 The heap sort runs in O(1 lgn) time. 


Each call to the heapify() function takes at most lgm steps because it iterates only along a 
path from the current element down to a leaf. The longest such path for a complete binary tree of 
n elements is lgn. The heapify() function is called n/2 times in the first for loop and n—-1 
times in the second for loop. That comes to less than (37/2) 1g n, which is proportional to n lg n. 


If we regard a sorting algorithm as a stream process wherein elements stream into an array in 
random order and then stream out in sorted order, then the heap sort can be regarded as an 
efficient mean between the extremes of the selection sort and the insertion sort. The selection 
sort does all its sorting during the removal stage of the process, having stored the elements in the 
unsorted order in which they arrived. The insertion sort does all its sorting during the insertion 
stage of the process so that the elements can stream out of the array in the sorted order in which 
they were stored. But the heap sort does a partial sorting by inserting the elements into a heap 
and then finishes the sorting as the elements are removed from the heap. The payoff from this 
mean between the extremes is greater efficiency: O(n lgn) instead of O(n’). 


THE SPEED LIMIT FOR COMPARISON SORTS 


Theorem 14.17 No sorting algorithm that rearranges the array by comparing its elements 
can have a worst-case complexity function better than O(7 lg 7). 


Consider the decision tree that covers all possible outcomes of the algorithm on an array of 
size n. Since the algorithm rearranges the array by comparing its elements, each node in the 
decision tree represents a condition of the form Cali] < a[j]). Each such condition has two 
possible outcomes (true or false), so the decision tree is a binary tree. And since the tree must 
cover all possible arrangements, it must have at least m! leaves. Therefore, by Corollary 11.3 on 
page 203, the height of the decision tree must be at least lg(n!). In the worst case, the number of 
comparisons that the algorithm makes is the same as the height of the decision tree. Therefore, 
the algorithm’s worst-case complexity function must be O(|g(7!)). 


Now by Stirling’s Formula (outlined on page 325), 


n 
n 
nl» Inn (2) 
e 


log(n!) = log( 2nt (2) ) = log(n”) = nlogn 


sO 


(Here, “log” means the binary logarithm log,.) Therefore, the algorithm’s worst-case complexity 
function must be O(n log n). 


Theorem 14.17 applies only to comparison sorts. A comparison sort is an algorithm that sorts 
elements by comparing their values and then changes their relative positions according to the 
outcomes of those comparisons. All the sorting algorithms outlined previously are comparison 
sorts. In contrast, the following sorting algorithms are not comparisons sorts. 
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THE RADIX SORT 


The radix sort assumes that the sequence’s element type is a lexicographic array of constant 
size, that is, either a character string type or an integer type. Let r be the array element’s radix 
(e.g., = 26 for ASCII character strings, r= 10 for decimal integers, r = 2 for bit strings), and let 
w be the constant width of the lexicographic array. For example, for U.S. Social Security 
numbers, d= 10 and w= 9. 


EXAMPLE 14.11 Sorting Books by Their ISBNs 


Every book published since the 1970s has been assigned a unique international standard book number 
(ISBN). These are usually printed at the bottom of the back cover of the book. For example, the ISBN for 
this book is 0071476989. (ISBNs are usually hyphenated, like this: 0-07-147698-9, to distinguish the four 
separate fields that make up the code.) The last digit is a check digit, computed from the other nine digits. 
Since it can be any of the 10 numeric digits or the letter X, we have that the radix r = 11, while the number 
of digits d= 10. 


Algorithm 14.10 The Radix Sort 
(Precondition: s = {s)...s,,} 1S a sequence of n integers or character strings with radix r and 
width w.) 
(Postcondition: The sequence s is sorted numerically or lexicographically.) 
1. Repeat step 2 ford =0uptow-l. 
2. Apply a stable sorting algorithm to the sequence s, sorting only on digit number d. 


A sorting algorithm is said to be stable if it preserves the relative order of elements with equal 
keys. For example, the insertion sort is stable, but the heap sort is not. 


EXAMPLE 14.12 Sorting ISBNs with the radix sort 


Figure 14.5 shows a sequence of 12 ISBNs and the first four iterations of the radix sort applied to it. 


0070308373 0071353461 0071342109 0071342109 
0071353461 0070308373 8838650527 007052713x 
0071342109 0071353453 0830636528 0071361286 
0071353453 0070308683 007052713x 0070308373 
0070308683 8838650454 0071353453 0071353453 
0071361286 9742080585 8838650454 8838650454 
007052713x 0071361286 0071353461 0071353461 
0830636528 8838650527 0070308373 0830628479 
0830628479 0830636528 0830628479 8838650527 
8838650527 0830628479 0070308683 0830636528 
8838650454 0071342109 9742080585 9742080585 
9742080585 007052713x 0071361286 0070308683 


Figure 14.5 Tracing the radix sort 


Note how the stability is needed to conserve the work done by previous iterations. For example, after 
the first iteration, 8838650527 precedes 0830636528 because 7 < 8. Both of these keys have the same 
value 2 in their second least significant digit (digit number d= 1). So on the second iteration, which sorts 
only on digit number 1, these two keys evaluate as being equal. But they should retain their previous 
relative order because 27 < 28. Stability guarantees that they do. 

The columns that have been processed are shaded. So after the third iteration, the right-most 3-digit 
subsequences are sorted: 109 < 13X < 373 < 453. (Note that X stands for the value 10. So 13X numeri- 
cally means 130 + 10 = 140.) 
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EXAMPLE 14.13 The Radix Sort 


This method assumes that the constants RADIX has WIDTH have been defined. For example, for arrays 
of ints: 


1 public static void sort(Cint[] a) { 

2 // POSTCONDITION: a[O] <= a[1] <= ... <= a[a.length-1]; 
3 for Cint d = 0; d < WIDTH; d++) { // step 1 

4 sort(a, d); // step 2 

5 } 

6 } 

7 

8 private static void sortCint[] a, int d) { 

9 // POSTCONDITION: a[] is sorted stably on digit d; 
10 int n = a.length; 

11 int[] c = new int[RADIX]; 

12 for Cint ai: a) { 

13 ++c[digit(d,ai)]; // tally the values in a[] 

14 } 

15 for Cint j = 1; j < RADIX; j++) f 

16 c{jl] += c{j-1]; // clj] == num elts in a[] that are <= j 
17 

18 int[] tmp = new int[n]; 

19 for (int i =n - 1; i >= 0; i--) { 

20 tmp[--c[digit(d, a[i])]] = afi]; 

21 } 

22 for Cint i = 0; i < n; i++) 

23 ali] = tmp[i]; 

24 } 

25 

26 private static int digit(Cint d, int x) f{ 

27 // returns digit number d of integer x 

28 // @.g., digit(2, 1234567890) returns 8; 

29 return x/Cint)Math. pow(10,d)%RADIX; 

30 } 


The secondary sorting method at line 8 is called a counting sort or tally sort. 


Theorem 14.18 The radix sort runs in O(7) time. 
The algorithm has WIDTH iterations and processes all n elements on each iteration three times. 
Thus, the running time is proportional to WIDTH*n and is a constant. 


Although O(n) is theoretically better than O(mlgn), the radix sort is rarely faster than the 
O(nlgn) sorting algorithms (merge sort, quick sort, and heap sort). That is because it has a lot of 
overhead extracting digits and copying arrays. 


THE BUCKET SORT 


The bucket sort is another distribution sort. It distributes the elements into “buckets” accord- 
ing to some coarse grain criterion and then applies another sorting algorithm to each bucket. It is 
similar to the quick sort in that all the elements in bucket i are greater than or equal to all the 
elements in bucket i—1 and less than or equal to all the elements in bucket i+1. Whereas quick 
sort partitions the sequence into two buckets, the bucket sort partitions the sequence into n 
buckets. 
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Algorithm 14.11 The Bucket Sort 
(Precondition: s = {sp ...s,.;} is a sequence of 7 ordinal values with known minimum value min 
and maximum value max.) 
(Postcondition: the sequence s is sorted.) 
1. Initialize an array of 1 buckets (collections). 
Repeat step 3 for each s; in the sequence. 
Insert s, into bucket j, where j = Lrn J], r = (s,;— min)/(max + 1 — min). 
Sort each bucket. 
Repeat step 6 for 7 from 0 ton—-1. 
6. Add the elements of bucket j sequentially back into s. 
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EXAMPLE 14.14 Sorting U.S. Social Security Numbers with the Bucket Sort. 


Suppose you have 1000 nine-digit identification numbers. Set up 1000 arrays of type int and then 
distribute the numbers using the formula j = | rn |, r = (s; — min)/(max + 1 — min) = (s,— 0)/(10° + 1 — 0) 
= s,/10°. So, for example, the identification number 666666666 would be inserted into bucket number / 
where j = |rn| = | (666666666/10°)(10°) | = | 666.666666 | = 666. Similarly, identification number 
123456789 would be inserted into bucket number 123, and identification number 666543210 would be 
inserted into bucket 666. (See Figure 14.6.) 

Then each bucket would be sorted. Note that the number of elements in each bucket will average 1, so 
the choice of sorting algorithm will not affect the running time. 

Finally, the elements are copied back into s, starting with bucket number 0. 


000284791 
000550376 


Bucket 0 — 
Bucket 1 000284791] 0 
000550376| 1 
ieee 002846113] 2 
002846113 004479198 | 3 
Bucket 2 004760115 | 4 
004766632] 5 
005503276] 6 
Bucket 3 
004479198 
004760115 
004766632 
Bucket 4 
Bucket 5 
005503276 
Bucket 6 


Figure 14.6 Tracing the bucket sort 
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EXAMPLE 14.15 The Bucket Sort 


1 public static void sort(Cint[] a) { 

2 // POSTCONDITION: a[O] <= a[1] <= ... <= a[a.length-1]; 
3 int min = min(a); 

4 int max = max(a); 

5 int n = a.length; 

6 Bucket[] bucket = new Bucket[n]; // step 1 
7 for Cint j=0; j<n; j++) f{ 

8 bucket[j] = new Bucket(); 

9 

10 for Cint i=0; i<n; i++) f{ // step 2 
1 int j = n*Cali] - min)/(max + 1 - min); 

12 bucket[j].add(a[i]); // step 3 
13 } 

14 int i=0; 

15 for (int j=0; j<n; j++) f{ 

16 Bucket bj=bucket[j]; 

17 bj.sortQ; // step 4 
18 for Cint k=0; k<bj.sizeQ; k++) f{ // step 5 
19 ali++] = bj.getck); 

20 } 

21 } 

22 } 

23 

24 private static int minCint[] a) { 

25 int min = a[0O]; 

26 for (int ai: a) { 

27 if (ai < min) { 

28 min = ai; 

29 } 

30 } 

31 return min; 

32 } 

33 

34 private static int maxCint[] a) { 

35 int max = a[0]; 

36 for Cint ai: a) { 

37 if (ai > max) { 

38 max = al; 

39 } 

40 } 

1 return max; 

42 } 


This program requires the implementation of this interface: 
public interface Bucket { 
public void addCint x); // appends x to end of bucket 


public int getCint k); // returns element k from bucket 
public int size(Q); // returns the number of elements 
} 
For example: 
43 private static class Bucket extends java.util.ArrayList<Integer> { 
44 void sort() { 
45 java.util.Arrays.sort(this.toArray()); 


46 } 
47 } 
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Theorem 14.19 The bucket sort runs in O(7) time. 

The algorithm has three parallel loops, each iterating n times. The last loop has an inner loop, 
but it averages only one iteration. The minimum() and maximum() methods also require n steps 
each. Hence the number of steps executed is proportional to 5n. 


Like the radix sort, the O(n) bucket sort is in practice much slower than the O(7 1g7) sorting 
algorithms because of the substantial overhead costs. 
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Review Questions 


Why is the bubble sort so slow? 


The bubble sort makes n(n — 1)/2 comparisons to sort n elements. How does it follow that its 
complexity function is O(n’)? 


Why are the O(7) sorting algorithms (radix sort and bucket sort) slower than the O(n Ign) 
sorting algorithms (merge sort, quick sort, and heap sort)? 


The merge sort applies the general method, known as divide and conquer, to sort an array. It 
divides the array into pieces and applies itself recursively to each piece. What other sorting 
algorithm(s) use this method? 


Which sorting algorithms work as well on linked lists as on arrays? 

Which sorting algorithms have a different worst case complexity than their average case? 
Which sorting algorithms have a different best case complexity than their average case? 
Why is the nonrecursive version of a recursive sorting algorithm usually more efficient? 
How is the quick sort like the merge sort? 

Under what circumstances would the merge sort be preferred over the other two O(n Ign) 
sorting algorithms? 

Under what circumstances is the quick sort like the selection sort? 

Under what circumstances would the quick sort be preferred over the other two O(v lgn) 
sorting algorithms? 

How is the heap sort similar to the selection sort and the insertion sort? 


Which algorithm does the Java API use to implement its java.util.Arrays.sort(Q 
methods? 


A sorting algorithm is said to be stable if it preserves the order of equal elements. Which of 
the sorting algorithms are not stable? 


Which of the nine sorting algorithms outlined in this chapter require extra array space? 


Which of the nine sorting algorithms outlined in this chapter would work best on an external 
file of records? 


The merge sort is parallelizable. This means that parts of it can be performed simultaneously, 
independent of each other, provided that the computer has multiple processors that can run in 
parallel. This works for the merge sort because several different parts of the array can be sub- 
divided or merged independently of other parts. Which of the other sorting algorithms 
described in this chapter are parallelizable? 
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14.19 Imagine a Web site that has a Java applet for each sorting algorithm that shows how the algo- 


14.1 


14.2 


14.3 


rithm works by displaying an animation of a test run on an array a[] of 256 random numbers 
in the range 0.0 to 1.0. The animation shows on each iteration of the algorithm’s main loop a 
two-dimensional plot of 256 points (x, y), one for each element in the array, where x = i+1 
and y= a[i]. Each plot in Figure 14.7 shows the progress halfway through the sort for one 
of the following six algorithms: 

selection sort 

insertion sort 

merge sort 

quick sort 

heap sort 

radix sort 
Match each plot with the sorting algorithm that produced it: 


Hh 0 
oom 


Figure 14.7 Sorting algorithms in motion 


If an O(n’) algorithm (e.g., the bubble sort, the selection sort, or the insertion sort) takes 3.1 
milliseconds to run on an array of 200 elements, how long would you expect it to take to run 
on a similar array of: 


a. 400 elements? 
b. 40,000 elements? 


Problems 


If an O(n lgn) algorithm (e.g., the merge sort, the quick sort, or the heap sort) takes 3.1 milli- 
seconds to run on an array of 200 elements, how long would you expect it to take to run on a 
similar array of 40,000 elements? 


The insertion sort runs in linear time on an array that is already sorted. How does it do on an 
array that is sorted in reverse order? 
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How does the bubble sort perform on: 
a. An array that is already sorted? 
b. An array that is sorted in reverse order? 


How does the selection sort perform on: 
a. An array that is already sorted? 
b. An array that is sorted in reverse order? 


How does the merge sort perform on: 
a. An array that is already sorted? 
b. An array that is sorted in reverse order? 


How does the quick sort perform on: 
a. An array that is already sorted? 
b. An array that is sorted in reverse order? 


How does the heap sort perform on: 
a. An array that is already sorted? 
b. An array that is sorted in reverse order? 


The bubble sort, the selection sort, and the insertion sort are all O(n’) algorithms. Which is 
the fastest and which is the slowest among them? 


The merge sort, the quick sort, and the heap sort are all O(mlg7) algorithms. Which is the 
fastest and which is the slowest among them? 


Trace by hand the sorting of this array 
int al] = { 44, 88, 55, 99, 66, 33, 22, 88, 77 } 

by each of the following algorithms: 

a. The quick sort 
The heap sort 
The bubble sort 
The selection sort 
The insertion sort 
The merge sort 


mo ao 


Modify the bubble sort so that it sorts the array in descending order. 

Modify the bubble sort so that it is “smart” enough to terminate as soon as the array is sorted. 
Prove Theorem 14.1 on page 257. 

Prove Theorem 14.2 on page 257. 


The shaker sort is the same as the bubble sort except that it alternates “bubbling” up and 
down the array. Implement the shaker sort, and determine whether it is more efficient than 
the straight insertion sort. 


Modify the selection sort (Algorithm 14.2 on page 257) so that it uses the smallest element of 
{s;...S,4} in step 2. 


Rewrite the selection sort recursively. 
Prove Theorem 14.3 on page 258. 
Prove Theorem 14.4 on page 258. 


Modify the insertion sort so that it sorts the array indirectly. This requires a separate index 
array whose values are the indexes of the actual data elements. The indirect sort rearranges 
the index array, leaving the data array unchanged. 
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Rewrite the insertion sort recursively. 
Prove Theorem 14.5 on page 259. 
Prove Theorem 14.6 on page 259. 
Prove Theorem 14.7 on page 259. 


Modify the quick sort so that it selects its pivot as the last element instead of the first element 
of the subsequence. 


Modify the quick sort so that it selects its pivot as the median of the first, middle, and last ele- 
ments. 


Modify the quick sort so that it reverts to the insertion sort when the array size is below 8. 


Since the heap sort runs in O(n 1gn) time, why isn’t it always preferred over the quick sort, 
which runs in O(7’) in the worst case? 


Since the heap sort runs in O(mlgn) time and requires no extra array space, why isn’t it 
always preferred over the merge sort, which requires duplicate array space? 


Prove Theorem 14.15 on page 269. 
Here is the Las Vegas sort, as applied to sorting a deck of cards: 
1. Randomly shuffle the cards. 


2. If the deck is not sorted, repeat step 1. 
Derive the complexity function for this sorting algorithm. 


Answers to Review Questions 


The bubble sort is so slow because it operates only locally. Each element moves only one position at a 
time. For example, the element 99 in Example 14.3 on page 256 is moved by six separate calls to the 
swap () function to be put into its correct position at a[8]. 


The run time is nearly proportional to the number of comparisons made. That number is n(n — 1)/2. For 
every positive integer n, n(n—1)/2 <n’, so n(n—1)/2 = O(n’). Thus, O(n”) is the complexity function. 


The O(n) sorting algorithms (radix sort and bucket sort) are slower than the O(n lgn) sorting algo- 
rithms (merge sort, quick sort, and heap sort) because, although their running time is proportional to 7, 
the constant of proportionality is large because of large overhead. For both the radix sort and the 
bucket sort, each iteration requires copying all the elements into a list of queues or arrays and then 
copying them back. 


The merge sort, quick sort, and bucket sort all use the divide-and-conquer strategy. 


The bubble sort, selection sort, insertion sort, merge sort, and quick sort work as well on linked lists as 
on arrays. 


The quick sort and bucket sort are significantly slower in the worst case. 
The insertion sort, shell sort, and radix sort are significantly faster in the best case. 
Recursion carries the overhead of many recursive method invocations. 


The quick sort implements the divide-and-conquer strategy: first it performs its O(|g7) partitioning of 
the sequence, and then it recursively sorts each of the two pieces independently. The merge sort imple- 
ments the divide-and-conquer strategy but in the reverse order: It makes its two recursive calls first 
before performing its O(lgv) merge. Both algorithms do O(n) amount of work O(|gn) times thus 
obtaining O(7 lgn) complexity. 

The merge sort is best for sorting linked lists and external files. 

The quick sort reverts to the selection sort in the worst case, when the sequence is already sorted. 
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The quick sort is best for sorting large arrays of primitive types. 


The selection sort can be seen as a sort-on-output process: Insert the elements into an array as they are 
given, and then repeatedly select out the next largest element. The insertion sort can be seen as a sort- 
on-input process: Repeatedly insert each element into its correct ordered position in an array, and then 
remove them in their array order. So the selection sort inserts the elements into the array in O(7) time 
and removes them in O(n”), while the insertion sort inserts the elements into the array in O(n’) time 
and removes them in O(n). Both result in an O(n’) algorithm. 

The heap sort can be seen as a partial-sort-on-input-and-partial-sort-on-output process: Insert the 
elements into an array maintaining the (partially sorted) heap property, and then repeatedly select the 
first (which is the smallest) element and restore the heap property. Both the insertion process and the 
removal process have the same O(n lgn) running time, resulting in a total O(7 lg) running time. 

The Java API uses the merge sort to implement its ArrayS.Ssort() methods for arrays of objects, 
and it uses the quick sort to implement its Arrays .Sort() methods for arrays of primitive types. 


The shell sort, quick sort, and heap sort are unstable. 
The merge sort, radix sort, and bucket sort require extra array storage. 


The bubble sort, selection sort, insertion sort, merge sort, and quick sort work as well on external files 
of records. 


The shell sort, merge sort, quick sort, and bucket sort all would run significantly faster on a parallel 
computer. 


Matching the algorithms with their graphical output is shown in Figure 14.8. 


. 


Merge Sort Heap Sort Radix Sort 
Quick Sort Selection Sort Insertion Sort 


Figure 14.8 Sorting algorithms in motion 
Solutions to Problems 


The O(n’) algorithm should take: 

a. 12.4 milliseconds (4 times as long) to run on the 400-element array. 

b. 124 seconds (40,000 times as long) to run on the 40,000-element array. That’s about 2 minutes. 
This answer can be computed algebraically as follows. The running time f is proportional to n’, 
so there is some constant c for which ¢ = c-n’. If it takes t= 3.1 milliseconds to sort n = 200 ele- 
ments, then (3.1 milliseconds) = c-(200 elements)’, so c = (3.1 milliseconds)/(200 elements)* = 
0.0000775 milliseconds/element”. Then, for n = 40,000, t = c-n* = (0.0000775 milliseconds/ 
element*)-(40,000 elements)” = 124,000 milliseconds = 124 seconds. 


The O(nlgn) algorithm should take 1.24 seconds (400 times as long) to run on the 40,000-element 
array. This answer can be computed algebraically. The running time ¢ is proportional to nlgn, so there 
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is some constant c for which ¢ = c-nlgn. If it takes t = 3.1 milliseconds to sort m = 200 elements, then 
(3.1) = c-(200) 1g(200), so c = (3.1 milliseconds)/(200-1g(200)) = 0.0155/lg(200). Then, for n = 40,000, 
t=c-nlg n= (0.0155/lg(200))(40,000-1g(40,000) ) = 620-(1g(40,000)/1g(200)). Now 40,000 = 2007, so 
12(40,000) = 1g(200°) = 2-1g 200. Thus, 1¢(40,000)/1g(200) = 2, so t= 620-2 milliseconds = 1240 milli- 
seconds = 1.24 s. 


The insertion sort has its worst performance on an array that is sorted in reverse order, because each 
new element inserted requires all of the elements on its left to be shifted one position to the right. 


The bubble sort, as implemented in Algorithm 14.1 on page 257, is insensitive to input. That means 
that it will execute the same number n(n—1)/2 of comparisons regardless of the original order of the 
elements in the array. So it doesn’t matter whether the array is already sorted or whether it is sorted in 
reverse order; it is still very slow. 


The selection sort is also insensitive to input: It takes about the same amount of time to sort arrays of 
the same size, regardless of their initial order. 

The merge sort is also insensitive to input: It takes about the same amount of time to sort arrays of the 
same size, regardless of their initial order. 


The quick sort is quite sensitive to input. As implemented in Algorithm 14.6 on page 263, the quick 
sort will degrade into an O(n’) algorithm in the special cases where the array is initially sorted in either 
order. That is because the pivot element will always be an extreme value within its subarray, so the 
partitioning splits the subarray very unevenly, thereby requiring 7 steps instead of lgn. 


The heap sort is a little sensitive to input, but not much. The heapify() function may require fewer 
than lg 7 iterations. 


The bubble sort is slower than the selection sort, and the insertion sort (in most cases) is a little faster. 
The merge sort is slower than the heap sort, and the quick sort (in most cases) is faster. 
a. Trace of the quick sort: 


aLO] | a[1] | al[2] | aL3] | al4] | a[5] | alo] | aL7] | al8] 
44 88 55 99 66 33 22 88 77 


77 99 
55 77 


al0O] | a[1] | a[2] | a[3] | a[4] | a[5] | a[6] | a[7] | al[8] 
44 88 55 99 66 33 22 88 77 
99 88 
99 44 
88 44 
77 99 
88 77 
44 88 
88 77 44 
22 88 
77 66 22 
33 77 
66 44 33 
22 66 
55 22 
33 55 
44 22 
33 44 
22 33 
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c. Trace of the bubble sort: 
aLO] | aL1] | a[2] | a[3] | al4] | aL5] | ale] | al7] | al8] 
44 88 55 99 66 33 22 88 77 
55 88 
66 99 
33 99 
22 99 
88 99 
77 99 
66 88 
33 88 
22 88 
77 88 
77 88 
33 66 
22 66 
33 55 
22 55 
33 44 
22 44 
22 33 
d. Trace of the selection sort: 
aLO] | al1] | a[2] | a[3] | aL4] | aL5] | al6] | al7] | aLl8] 
44 88 55 99 66 33 22 88 77 
22 44 
33 88 
44 55 
55 99 
77 88 
88 99 
e. Trace of the insertion sort: 
aLlO] | a[1] | a[2] | a[3] | a[4] | a[5] | a6] | al7] | a8] 
44 88 55 99 66 33 22 88 77 
55 88 
66 88 99 
33 44 55 66 88 99 
22 33 44 55 66 88 99 
88 99 
77 88 88 99 
f. Trace of the merge sort: 
aLlO] | a[1] | a[2] | a[3] | a[4] | a[5] | a6] | al7] | a8] 
44 88 55 99 66 33 22 88 77 
44 55 77 99 
33 66 
77 88 
22 33 66 77 88 
22 33 44 55 66 77 88 88 99 
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public static void sortCint[] a) { 
for Cint i = a.length-1; i > 0; i--) { 
for Cint j = 0; j < i; j++) f{ 
if (a[j] > a[j+1]) f{ 
swap(a, j, jt+l); 
} 
} 
} 
} 


public static void sortCint[] a) { 
boolean sorted=false; 
int i = a.length-1; 
while (i > O && !sorted) { 
for Cint j = 0; j < i; j++) f{ 
sorted = true; 
if (a[j] > alj+1]) f{ 
swap(a, j, j+1); 
sorted = false; 
} 
} 
--i; 
} 
} 


The loop invariant can be used to prove that the bubble sort does indeed sort the array. After the first 
iteration of the main 7 loop, the largest element must have moved to the last position. Wherever it 
began, it had to be moved step by step all the way to the right, because on each comparison the larger 
element is moved right. For the same reason, the second largest element must have been moved to the 
second-from-last position in the second iteration of the main i loop. So the two largest elements are in 
the correct locations. This reasoning verifies that the loop invariant is true at the end of every iteration 
of the main i loop. But then, after the last iteration, the n-1 largest elements must be in their correct 
locations. That forces the nth largest (i.e., the smallest) element also to be in its correct location, so the 
array must be sorted. 


The complexity function O(n’) means that, for large values of n, the number of loop iterations tends to 
be proportional to n. That means that, if one large array is twice the size of another, it should take 
about four times as long to sort. The inner j loop iterates n—1 times on the first iteration of the outside 
i loop, n—2 times on the second iteration of the i loop, n—3 times on the third iteration of the i loop, 
and so on. For example, when n = 7, there are six comparisons made on the first iteration of the i loop, 
five comparisons made on the second iteration of the i loop, four comparisons made on the third iter- 
ation of the i loop, and so forth, so the total number of comparisons is 6+5+4+3+2+1=21.In 
general, the total number of comparisons will be (n—1) + (n—2) + (n—3) +--+ +34+2+1. This sum 
is n(n—1)/2. (See Theorem A.7 on page 323.) For large values of n, that expression is nearly n’/2 


which is proportional to n?. 


public static void sortCint[] a) { 
boolean sorted=false; 
for Cint i = a.length; i > 0; i -= 2) { 
for Cint j = 1; j < i; j++) f{ 
if (a[j-1] > al[j]) f 
swap(a,j-1,5); 
} 
} 
for Cint j = 1-2; j > 0; j--) { 
if (a[j-1] > al[j]) f 
swap(a, j-1, j); 
} 
} 
} 
} 
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public static void sort(Cint[] a) { 
for Cint i = 0; i < a.length-1; i++) { 
int j=i; 
for Cint k = i4+1; k < a.length; k++) { 
if (a[k] < a[j]) { 
j=k; 
} 


} 


swap(a, i, j); 


} 


public static void sort(Cint[] a) { 
sort(a, a.length); 
} 


private static void sortCint[] a, int n) { 
if (n < 2) { 
return; 
} 
int j = 0; 
for (int k = 1; k < n; k++) { 
if (a[k] > a[j]) f 
j =k; 
} 
} 
swap(a, n-1, j); 
sort(a, n-1); 


} 


The last loop invariant proves correctness. So, like the proof for the bubble sort, we need only verify 
the loop invariants. 

On the first iteration of the main loop (step 1), a[i] is the last element in the array, so the index 
k of the inner loop runs through every element after a[0]. The value of the index j begins at 0 and 
then changes each time k finds a larger element. Since j is always reset to the index of the larger 
element, a[j] will be the largest element of the array when the inner loop finishes. This verifies the 
first loop invariant. On each successive iteration of the outer loop, the index k runs through the 
remaining unsorted segment of the array, so for the same reason, a[j] will be the largest element of 
that remaining segment when the inner loop finishes. This verifies that the first loop invariant is true 
on every iteration of the outer loop. 

Since swap(a,i,j) simply interchanges ali] with a[j], the second loop invariant follows 
from the first. 

The third loop invariant follows from the second and by mathematical induction. During the first 
iteration of the main loop, the inner loop finds a[j] to be the largest element in the array. The 
swap(a,i,j) puts that largest element in the last location a[i], so a[i] must be >= all the a[j]. 
Prior to the ith iteration of the main loop, we have by the inductive hypothesis that the subarray 
aLi+1..n) is sorted and all the values in the subarray a[0..i] are smaller than a[i+1]. Then after 
the ith iteration, a[i] is one of those smaller elements, so ali] < a[i+1] <...<a[n-1]. 


Again, the proof is essentially the same as that for the corresponding theorem for the bubble sort. On 
the first iteration of the outer i loop, the inner j loop iterates n—1 times. On the second, it iterates n — 
2 times. This progression continues, for a total of (n—1) + (n—2) +--+ +2+1=n(n—-1)/2. 


public static void sortCint[] a, int[] ndx) { 


for Cint i = 1; i < a.length; i++) { 
int ndxi = ndx[i], j; 
for (j = i; j > 0 & a[ndx[j-1]] > a[ndxi]; j--) f 
ndx[j] = ndx[j-1]; 


} 
ndx[j] = ndxi; 
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public static void sort(int[] a) { 
sort(a, a.length); 
} 


public static void sortCint[] a, int n) { 
if (n < 2) { 
return; 
} 
sort(a, n-1); 
int temp = a[n-1], j; 
for (j = n-1; j > O && a[j-1] > temp; j--) f{ 
alj] = alj-1]; 
} 
a[j] = temp; 
} 


On the first iteration of the main loop, a[1] is compared with a[0] and interchanged if necessary. So 
aL0] < a[1] after the first iteration. If we assume that the loop invariant is true prior to some kth iter- 
ation, then it must also be true after that iteration because during it a[k] is inserted between the ele- 
ments that are less than or equal to it and those that are greater. It follows by mathematical induction 
that the loop invariant is true for all k. 


ll 


The proof is similar to that for the corresponding theorems for the bubble sort and the selection sort. 
On the first iteration of the outer i loop, the inner j loop iterates once. On the second, it iterates once 
or twice, depending upon whether a[1] > a[2]. On the third iteration, the inner j loop iterates at most 
three times, again depending upon how many of the elements on the left of a[3] are greater than a[3]. 
This pattern continues, so that on the kth iteration of the outer loop, the inner loop iterates at most k 
times. Thus the maximum total number of iterations is 1 +2 +---+(n—1)=n(n—1)/2. 


In this case, the inner loop will iterate only once for each iteration of the outer loop. So the total num- 
ber of iterations of the inner loop is: 1 +1+1+---+1+1=n-1. 


For the quick sort pivoting on the last element, the only changes needed are in the partition() 
method: 
private static int partitionCint[] a, int p, int q) { 
int pivot = a[q-1], i = p-1, j = q-1; 
while Ci < j) f{ 
while Ci < j && a[++i] <= pivot) ; // empty loop 


if G <j) f 
alj] = ali]; 
} 
while (j > 1 && a[--j] >= pivot) ; // empty loop 
if Gj >if 
ali] = aljl]; 
} 
} 
alj] = pivot; 
return j; 


} 


For the quick sort pivoting on median of three elements, the only changes needed are in the 
partition() method: 
private static int partitionCint[] a, int p, int q) { 
int m= (p + q)/2; 
m = indexOfMedian(a, p, m, q-1); 
swap(a, p, m); 
// The rest is the same as lines 20-32 in Example 14.9 on page 264 
} 
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This requires a method for locating the index of three array elements: 
private static int indexOfMedian(int[] a, int i, int j, int k) { 

// Returns the index of the median of {a[i], a[j], a[k]} 

if cali] <= a[j] && a[j] <= a[k]) return j; 

if cali] <= afk] && a[k] <= a[j]) return k; 

if ca[j] <= ali] && afi] <= a[k]) return i; 

if ca[j] <= a[k] && afk] <= a[i]) return k; 

if ca[k] <= ali] && afi] <= a[j]) return i; 

return j; 


} 


14.28 For the quick sort with reversion to the insertion sort on arrays of size < 8, the only changes needed are 


14.29 
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in the sortQ method: 
private static void sortCint[] a, int p, int q) { 
if (qq-p<2) f{ 
return; 
} 
if q-p<8) ft 
insertionSort(a, p, q); 


return; 
} 
int m = partition(a, p, q); 
sort(a, p, m); // steps 2 & 3 
sort(a, m+1, q); // step 4 


} 
This requires a generalization of the insertion sort: 
public static void insertionSortCint[] a, int p, int q) { 
for Cint i = p+1; i < q; i++) { 
int ai = a[i], j; 
for (j = i; j > 0 & a[j-1] > ai; j--) f 
aj] = alj-1]; 
} 
a[j] = ai; 
} 
} 


The heap sort is not always preferred over the quick sort because it is slower in the average case. 
The heap sort is not always preferred over the merge sort because it is not stable. 


The postcondition of heapify (Algorithm 14.9 on page 266) establishes the loop invariant in step 3. 
That guarantees that the root sp is the maximum element of the subsequence. Step 5 inserts that maxi- 
mum at the end of the subsequence. So when the loop at step 4 is finished, the sequence will be sorted. 
The heapify algorithm restores the heap property to the complete segment ss by applying the 
heapifyDown() method from its root. 


The Las Vegas sort has complexity O(n”). 


There are n! different permutations of a deck of n cards. Shuffling them is equivalent to selecting 
one permutation at random. Only one of the 7! permutations leaves the cards in order. So the expected 
number of random shuffles required before the correct one occurs is m!/2. Then each permutation takes 
n—1 comparisons to see if it is the correct one. So the total complexity is O(mn!/2). By Stirling’s 
Formula (page 325), O(n n!/2) = O(n!) = O(2”). 


Graphs 


A graph is a nonlinear structure. Like linear data structures (lists), it can be implemented with 
either an indexed or a linked backing data structure. 


SIMPLE GRAPHS 


A (simple) graph is a pair G = (V, E), where V and E are finite sets and every element of F is a 
two-element subset of V (i.e., an unordered pair of distinct elements of V). The elements of V are 
called vertices (or nodes), and the elements of EF are called edges (or arcs). If e€E, then e = {a, b} 
for some a, b € V. In this case, we can denote e more simply as e = ab = ba. We say that the edge 
e connects the two vertices a and b, that e is incident with a and b, that a and b are incident upon 
e, that a and b are the terminal points or end points of the edge e, and that a and b are adjacent. 

The size of a graph is the number of elements in its vertex set. 


EXAMPLE 15.1 A Simple Graph a b 


Figure 15.1 shows a simple graph (V, £) of size 4. Its vertex set is 
V= {a, b, c, d}, and its edge set is E = {ab, ac, ad, bd, cd‘. This c d 


graph has four vertices and five edges. Figuré 15:1" A graph 


Note that by definition an edge is a set with exactly two 
elements. This prevents the possibility of a loop being an edge because a loop involves only one 
vertex. So the definition of simple graphs excludes the possibility of loops. 

Also note that since F is a set, an edge cannot be listed more than once. (Sets do not allow 
repeated members.) So the definition of simple graphs excludes the possibility of multiple edges. 

In general, graphs may include loops and multiple edges; simple graphs do not. 


GRAPH TERMINOLOGY 


If G=(¥/, £) isa graph and G’ = (’, E’) where V” € Vand EF’ C E£, then G’ is called a subgraph 
of G. If V = V, then G' is called a spanning subgraph of G. 
Every graph is a spanning subgraph of itself. 
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EXAMPLE 15.2 Subgraphs G, 


The graph G, = (V,, £,) in Figure 15.2 with vertex set V, = {a, b, d} 
and edge set EF’, = {ad, bd} is a nonspanning subgraph of the graph in d 
Example 15.1. This subgraph has size 3. 

The graph G, = (V,, E>) in Figure 15.2 with vertex set V, = {a, b, c, a} 
and edge set FE, = {ab, ac, cd} is a spanning subgraph of the graph in 
Example 15.1. This subgraph has size 4. 


c 1 
The degree (or valence) of a vertex is the number of edges that : 


are incident upon it. For example, in the graph in Figure 15.1 on 

page 285, vertex a has degree 3 and vertex b has degree 2. But in 

the subgraph G, in Figure 15.2, vertices a and b both have degree 1. 
An isolated point is a vertex of degree 0. 


Figure 15.2 Subgraphs 


Theorem 15.1 The sum of the degrees of the vertices of a graph with m edges is 2m. 
Each edge contributes | to the degree of each of the two vertices that determine it. So the total 
contribution if m edges is 2m. 


A complete graph is a simple graph in which every pair of vertices is connected by an edge. 
For a given number n of vertices, there is only one complete graph of that size, so we refer to the 
complete graph on a given vertex set. 


EXAMPLE 15.3 The Complete Graph on a Set of Four Vertices a b 


Figure 15.3 shows the complete graph on the set V = {a, b, c, d}. Its 
edge set Fis E= {ab, ac, ad, bc, bd, cd}. c d 

Note that the graphs in the previous examples are subgraphs of this one. Pigore153A-compiets 
Theorem 15.2 The number of edges in the complete graph on z vertices is n(m—1)/2. 

There are 7 vertices, and each of them could be adjacent to m—1 other vertices. So there are 
n(n—1) ordered pairs of vertices. Therefore, there are n(n—1)/2 unordered pairs because each 
unordered pair could be ordered in two ways. For example, the unordered pair {a, b} can be 
ordered as either (a, b) or (b, a). 


For example, the number of edges in the complete graph on the four-vertex set in Example 
15.3 is n(n—1)/2 = 4(4-1)/2 = 6. 


Corollary 15.1 The number of edges in any graph on 7 vertices ism < n(n-1)/2. 


PATHS AND CYCLES 


A walk from vertex a to vertex b in a graph is a sequence of edges (aja), @,d, . . . » AeA) 
where a) = a and a, = b, that is, a sequence of edges (e,, @, ..., e,) where if edge e; connects 
some vertex to vertex a, then the next edge e,,, connects that vertex a, to some other vertex, 
thereby forming a chain of connected vertices from a to b. The /ength of a walk is the number k 
of edges that form the walk. 


CHAP. 15] GRAPHS 287 


Although a walk is a sequence of edges, it naturally induces a sequence of adjacent vertices 


which the edges connect. So we may denote the walk (aya,, a;a, .. . , @,a,) more simply by 
A)Q,a,... A,a, as long as each pair (a;,, a;) is a valid edge in the graph. 
If p = aya,a)... Ga, 1s a Walk in a graph, then we refer to p as a walk from a to a, (or from 


a; to dy), and we say that p connects ay to a, and that a, and a, are connected by p. We also refer to 
a, and a, as the terminal points or the end points of the walk. 
A path is a walk whose vertices are all distinct. 


EXAMPLE 15.4 Graph Paths 


In the graph in Figure 15.4, abcfde is a path of length 
5. It is, more formally, the path (ab, bc, cf, fd, de). 
The walk abefdbc of length 6 is not a path because 


vertex b appears twice. The walk abefa of length 4 is . 7 
also not a path. 
The sequence abf is not a walk because bf is not an 
edge. And the sequence abb is not a walk because bb is 
not an edge. 


Finally, aba is a walk of length 2, and ab is a walk of 
length 1. 


A graph is said to be connected if every pair of f 

its vertices are connected by some path. A graph 

that is not connected is called disconnected. Figure 15.4 Paths in graphs 
All the graphs in the previous examples are 

connected. 


EXAMPLE 15.5 A Disconnected Graph ——_ 
Figure 15.5 shows a graph of size 12 that is dx 
not connected. e o———+ 


A connected component of a graph is a Figure 15.5 A disconnected graph 
subgraph that is maximally connected, that 
is, a connected subgraph with the property 
that any larger subgraph that contains it is disconnected. 

The graph in Example 15.5 has five connected components, of sizes 3, 1, 4, 2, and 2. 


Theorem 15.3 Every graph is a union of a unique set of connected components. 


A walk is closed if its two end points are the same vertex. A cycle is a closed walk of length at 
least 3 whose interior vertices are all distinct. 


EXAMPLE 15.6 Graph Cycles 


In the graph shown in Figure 15.4: 
¢ The walk abefa is a cycle. 
¢ The walk abedbcfa is not a cycle because it is not a path: It has the duplicate internal 
vertex b. 
¢ The path abef is not a cycle because it is not closed. 
¢ And the walk aba is not a cycle because its length is only 2. 
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A graph is said to be acyclic if it contains no cycles. 

Among the graphs shown above, only the ones in Figure 15.2 on page 286 are acyclic. 

An acyclic graph is also called a free forest, and a connected acyclic graph is also called a free 
tree. Note that a tree is the same as a free tree in which one node has been designated as the root. 
So in the context of graph theory, a tree is called a rooted tree, which is defined to be a connected 
acyclic graph with one distinguished node. 

A spanning tree of a graph is a connected acyclic spanning subgraph. 


EXAMPLE 15.7 Spanning Trees 


Figure 15.6 shows a graph and a spanning tree for it. 


Figure 15.6 The graph on the right is a spanning tree of the graph on the left 


ISOMORPHIC GRAPHS 


An isomorphism between two graphs G = (V, £) and G' = (V’, E’) is a function / that assigns to 
each vertex x in V some vertex vy = f(x) in V’ so that the following three conditions are satisfied: 
1. fis one-to-one: Each x in V gets assigned a different y = f(x) in V’. 
2. fis onto: Every yin V’ gets assigned to some x in V. 
3. fpreserves adjacency: If {x,, x,} is an edge in F, then {f(x,), f(x,)} is an edge in EF’. 
Two graphs are said to be isomorphic if there is an isomorphism from one to the other. The 
word isomorphic means “same form.” When applied to graphs, it means that they have the same 
topological structure. Graphically, two graphs are isomorphic if one can be twisted around to the 
same shape as the other without breaking any of the edge connections. 


EXAMPLE 15.8 Isomorphic Graphs 


The two graphs in Figure 15.7 on page 289 are isomorphic. The isomorphism is indicated by the corre- 
sponding vertex labels. 

It can be verified that if vertex x, is adjacent to vertex x, in one graph, then the corresponding vertices 
are adjacent in the other graph. For example, vertex a is adjacent to vertices b, d, e, and f (but not c, g, 
or /) in both graphs. 


To prove that two graphs are isomorphic (by definition), it is necessary to find an isomorphism 
between them. This is equivalent to labeling both graphs with the same set of labels so that 
adjacency applies equally to both labelings. Finding such an isomorphism by chance is unlikely 
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a b 


g a 
Figure 15.7 Isomorphic graphs 


because there are n! different possibilities. For example, there are 8! = 40,320 different possible 
ways to assign the 8 labels to the 8 vertices of each graph in Example 15.8. The following 
algorithm is more efficient: 
1. Arbitrarily label the vertices of one graph. (Assume here that the positive 
integers are used for labels.) 
2. Find a vertex on the second graph that has the same degree as vertex | on 
the first graph, and number that vertex 1 also. 
3. Label the vertices that are adjacent to the new vertex | with the same num- 
bers that correspond to the vertices that are adjacent to the other vertex 1. 
4. Repeat step 3 for each of the other newly labeled vertices. 
If at some point in the process, step 3 is not possible, then backtrack and try a different labeling. 
If no amount of backtracking seems to help, try proving that the two graphs are not isomorphic. 
To prove that two graphs are not isomorphic (by definition) would require showing that every 
one of the possible 7! different labellings fails to preserve adjacency. That is impractical. The 
following theorem makes it much easier to prove that two graphs are not isomorphic. 


Theorem 15.4 Isomorphism Tests for Graphs 

All of the following conditions are necessary for two graphs to be isomorphic: 

They must have the same number of vertices. 

They must have the same number of edges. 

They must have the same number of connected components. 
They must have the same number of vertices of each degree. 
5. They must have the same number of cycles of each length. 


ee 


EXAMPLE 15.9 Proving that Two Graphs Are Not Isomorphic 


Figure 15.8 shows three graphs, to be compared with the two isomorphic graphs in Figure 15.7. Each 
of these graphs has eight vertices, so each could be isomorphic to those two graphs. 

Graph G, is not isomorphic to those two graphs because it has only 14 edges. The graphs in Figure 15.7 
each have 16 edges. Condition 2 of Theorem 15.4 says that isomorphic graphs must have the same 
number of edges. 

Graph G, does have 16 edges. But it is not isomorphic to the two graphs in Figure 15.7 because it has 
two connected components. Each of the two graphs in Figure 15.7 has only one connected component. 
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G, G, a3 G; XK 


Figure 15.8 Possibly isomorphic graphs 


Condition 3 of Theorem 15.5 says that isomorphic graphs must have the same number of connected 
components. 

Graph G; has 16 edges and only one connected component. But it is still not isomorphic to the two 
graphs in Figure 15.7 because it has some vertices of degree 3 (and some of degree 5). All the vertices of 
the two graphs in Figure 15.7 have degree 4. Condition 4 of Theorem 15.5 says that isomorphic graphs 
must have the same number of vertices of each degree. 


Note that in Example 15.9 we really only have to compare each graph with one of the two 
graphs in Figure 15.7 on page 289, not both of them. 


Theorem 15.5 Graph Isomorphism Is an Equivalence Relation 
The isomorphism relation among graphs satisfies the three properties of an equivalence relation: 
1. Every graph is isomorphic to itself. 
2. If G, is isomorphic to G, then G, is isomorphic to G,. 
3. If G, is isomorphic to G, and G, is isomorphic to G;, then G, is isomorphic 
to G;. 


THE ADJACENCY MATRIX FOR A GRAPH 


An adjacency matrix for a graph (V, E) is a two- 4 bh a_bec id 
dimensional boolean array alFi[|rail@rl|t 
boolean[][] a; ple ese ee 
obtained by ordering the vertices V= {v, vj, ..., Vii} c d 
and then assigning true to a[i][j] if and only if 3) Mia ei lea (es 
vertex v; 18 adjacent to vertex v,. djt|ti|trlr 


EXAMPLE 15.10 An Adjacency Matrix 


Figure 15.9 An adjacency matrix 


Figure 15.9 shows the adjacency matrix for the graph 
in Figure 15.1 on page 285. 


Note the following facts about adjacency matrices: 
1. The matrix is symmetric, that is, a[i][j] == a[lj][i] will be true for all i 
and j 
2. The number of true entries is twice the number of edges. 
3. Different orderings of the vertex set V will result in different adjacency 
matrices for the same graph. 
Adjacency matrices are often expressed with Os and Is instead of trues and falses. In that 
form, the adjacency matrix for Figure 15.1 would be the one shown in Figure 15.10 on page 291. 
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THE INCIDENCE MATRIX FOR A GRAPH 


An incidence matrix for a graph (V, E) is a two- 
dimensional array 
int[][] a; 
obtained by ordering the vertices V= {v, v,..., Vii} 
and the edges F = {é), €,,..., €,,} and then assigning 
1 to aLi] [j] if vertex v, is incident upon edge e, and 0 
otherwise. 


EXAMPLE 15.11 An Incidence Matrix 


Figure 15.11 shows the incidence 
matrix for the graph in Figure 15.1 on page 
285. The first row indicates that vertex a is 
incident upon edges 1, 2, and 3; the second 
row indicates that vertex 5 is incident upon 
edges | and 4, and so forth. 


a 


Note that for simple graphs, no 
matter how many vertices and edges 
they have, there will always be exactly 
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abe d 
a}o;}1]1]1 


Figure 15.10 An adjacency matrix 


b al1i/1/1/0/0 
b}1/0/o0;1/0 
d cl011/0/01 4 


d0011 1 


Figure 15.11 A graph and its incidence matrix 


two Is in each column of any incidence matrix. Why? (See Review Question 15.9 on page 305.) 


THE ADJACENCY LIST FOR A GRAPH 


An adjacency list (or adjacency structure) for a 
graph (V, £) is a list that contains one element for each 
vertex in the graph and in which each vertex list 
element contains a list of the vertices that are adjacent 
to its vertex. The secondary list for each vertex is 
called its edge list. 


EXAMPLE 15.12 An Adjacency List 


Figure 15.12 shows the adjacency list for the graph in 
Figure 15.1 on page 285. 

The edge list for vertex a has three elements, one for 
each of the three edges that are incident with a; the edge list 
for vertex b has two elements, one for each of the two edges 
that are incident with b; and so on. 


Note that each edge list element corresponds to a 
unique 1 entry in the graph’s corresponding incidence 
matrix. For example, the three elements in the edge list 
for vertex a correspond to the three 1s in the first row 
(the row for vertex a) in the incidence matrix in Figure 
15.11. 


The 
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Figure 15.12 An adjacency list 


292 GRAPHS [CHAP. 15 


Also note that the edge lists are not ordered, that is, their order is irrelevant. 


DIGRAPHS 


A digraph (or directed graph) is a pair G = (V, E) where V is a finite set and EF is a set of 
ordered pairs of elements of V. As with (undirected) graphs, the elements of V are called vertices 
(or nodes) and the elements of EF are called edges (or arcs). If e € E, then e = (a, b) for some a, 
b€ V. In this case, we can denote e more simply as e = ab. We say that the edge e emanates from 
(or is incident from) vertex a and terminates at (or is incident to) vertex b. The outdegree of a 
vertex is the number of edges that emanate from it. The indegree of a vertex is the number of 
edges that terminate at it. 

Note that, unlike the graph definition, the digraph definition naturally allows an edge to termi- 
nate at the same vertex from which it emanates. Such an edge is called a Joop. A simple digraph 
is a digraph that has no loops. 


EXAMPLE 15.13 A Digraph 


a b 
Figure 15.13 shows a digraph with vertex set V= {a, b, c, d} | 
and edge set E = {ab, ad, bd, ca, dc}. 4 d 
Vertex a has outdegree 2 and indegree 1. Vertices b and c 
each have outdegree 1 and indegree 1. Vertex d has outdegree Figure 15.13 A digraph 


1 and indegree 2. 


Theorem 15.6 If Gis a digraph with m edges, then the sum of all outdegrees equals m and 
the sum of all indegrees equals m. 

Each edge contributes | to the total of all outdegrees and | to the total of all indegrees. So each 
total must be m. 


The complete digraph a the digraph that has a (directed) edge from every vertex to every other 
vertex. 


EXAMPLE 15.14 The Complete Digraph on Six Vertices 


The graph shown in Figure 15.14 is the complete digraph on six 
vertices. It has 15 double-directed edges, so the total number of 
(one-way) edges is 30, which is n(n—1) = 6(6—1) = 6(5) = 30. 


Theorem 15.7 The number of edges in the complete 
digraph on 7 vertices is n(n—-1). 
By Theorem 15.2 on page 286, there are n(n—1)/2 
undirected edges on the corresponding complete undirected 
graph. That makes n(n—1)/2 double-directed edges, so the 
total number of (one-way) directed edges must be twice that Figure 15.14 A complete digraph 
number. 


Corollary 15.2 The number of edges in any digraph on n vertices is m < n(n-1). 
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Every digraph has an embedded graph, obtained by converting each directed edge into an 
undirected edge and then removing duplicate edges and loops. Mathematically, this amounts to 
converting each ordered pair (x, y) of vertices in £ into the set {x, y} and then removing all sets 
of size one (i.e., singletons). 


EXAMPLE 15.15 The Embedded Graph of a Digraph a b 


The embedded graph of the digraph in Figure 15.13 is the 
graph shown in Figure 15.15. c d 


An adjacency matrix for a digraph (V, E) is a two-dimen- Bigare ts 5: amembedded raph 


sional boolean array 

boolean[][] a; 
obtained by ordering the vertices V = {v, v,, ..., v,.,} and then assigning true to ali] [j] if 
and only if there exists an edge emanating from vertex v, and terminating at vertex v,. 


EXAMPLE 15.16 An Adjacency Matrix for a Digraph 


GD 6d 
Figure 15.16 shows the adjacency matrix for the graph in a} F/T/)F/T 
Figure 15.13 on page 292. blr lrlele 
Note that the number of true entries in an adjacency Ole Pera es 
matrix for a digraph is equal to the number of edges. Also, d|F|F|T|F 
as with undirected graphs, different orderings of the vertex 
set V will result in different adjacency matrices for the Figure 15.16 An adjacent matrix 


same digraph. 


An incidence matrix for a digraph (V, £) is a two-dimensional integer array 
int[][] a; 
obtained by ordering the vertices V= {v, v,,..., V,.} and the edges F = {@), e,..., @,,} and 
then assigning 1 to ati] [j] and—1 to a[j] [i] if there exists an edge emanating from vertex v, 
and terminating at vertex v,, and assigning 0 everywhere else. 


EXAMPLE 15.17 An Incidence Matrix for a Digraph 


Figure 15.17 shows an incidence matrix for the 


digraph in Figure 15.13 on page 292. a b a} 1]1]0]-1] 0 
The first row indicates that two edges emanate | 2 F pl|-alo]a|o | o 
from vertex a and one edge terminates there. 
The last 1 is in the row for vertex d and the last (eet RPO) Oh ar ee 
column. The only other nonzero entry in that d|o|-1}-1] 0] 1 


column is the -1 in the row for vertex c, meaning 


that this edge emanates from vertex d and termi- 
nates at vertex c. Figure 15.17 An incidence matrix 


An adjacency list for a digraph (V, £) is a list that contains one element for each vertex in the 
graph and in which each vertex list element contains a list of the edges that emanate from that 
vertex. This is the same as the adjacency list for a graph, except that the links are not duplicated 
unless there are edges going both ways between a pair of vertices. 
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EXAMPLE 15.18 An Adjacency List for a Digraph 


Figure 15.18 shows the adjacency list for the digraph in 
Figure 15.13 on page 292. The edge list for vertex a has two 
elements, one for each of the two edges that emanate from a: 
ab and ad. 


PATHS IN A DIGRAPH 


A walk from vertex a to vertex b in a digraph is a 
sequence of edges (a)@,, Aa, ..., a,,a,) where a, =a 
and a, = b. As with undirected paths in an undirected 
graphs, directed paths are usually abbreviated by their 
vertex string: p = a@a,a, ... a,,a,. Either way, we say 
that the path emanates from (or starts at) vertex a and 
terminates at (or ends at) vertex b. 

A walk is closed if it terminates at the same vertex from which it emanates. A path is a walk 
with all distinct vertices. A cycle is a closed walk with all interval vertices distinct. 


Figure 15.18 An adjacency list 


EXAMPLE 15.19 Directed Paths 


In the digraph of Figure 15.13, adcabdc is a walk of length 6 which is not closed. The walk abdcacda is 
closed, but it is not a cycle because d (and c) are repeated internal vertices. The walk dcab is a path, which 
is not closed. The walk cabdc is a cycle of length 4, and the walk dcad is a cycle of length 3. 


Note that different cycles may traverse the same vertices. For example, adca and cadc are 
different cycles in the digraph in Figure 15.13. 

A digraph is strongly connected if there is a path between every pair of vertices. A digraph is 
weakly connected if its embedded graph is connected. A digraph that is not weakly connected is 
said to be disconnected. 


EXAMPLE 15.20 Strongly Connected and Weakly Connected Digraphs 


In Figure 15.19, digraph G, is strongly connected (and therefore also weakly connected). Digraph G; is 
weakly connected, but not strongly connected because there is no path that terminates at vertex x. Digraph 
G; is disconnected. 


a 
Ua 


Figure 15.19 Strongly connected and weakly connected components 


bP) 
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WEIGHTED DIGRAPHS AND GRAPHS 


A weighted digraph is a pair (V, w) where V is a finite set of vertices and w is a function that 
assigns to each pair (x, y) of vertices either a positive integer or © (infinity). The function w is 
called the weight function, and its value w(x, y) can be interpreted as the cost (or time or distance) 
for moving directly from x to y. The value w(x, y) = © indicates that there is no edge from x to y. 

A weighted graph is a weighted digraph (V,w) whose weight function w is symmetric, that is, 
w(x) = w(x,y) for all x,yeV Just as every digraph has an embedded graph, every weighted 
digraph has an embedded weighted graph (V, w) and an embedded (unweighted) digraph. The 
weight function for the embedded weighted graph can be defined as w'(x, vy) = min{w(x,y), 
w(y,x)}, where w is the weight function of the weighted digraph. The vertex set for the embedded 
digraph can be defined as E = {(x,y) : w(x,y) < ©}. 

The properties described above for digraphs and graphs apply to weighted digraphs and 
weighted graphs. In addition there are some extended properties that depend upon the underlying 
weight function in the obvious manner. For example, the weighted path length is the sum of the 
weights of the edges along the path. And the shortest distance from x to y would be the minimum 
weighted path length among all the paths from x to y. 


EXAMPLE 15.21 A Weighted Digraph and Its Embedded Structures 


Figure 15.20 shows a weighted digraph together with its embedded weighted graph, its embedded 
digraph, and its embedded graph. The weights are shown on the edges. 


Gi a 3 b G, a 3 b G; a b Gy a b 
Cc i d i d d d 


c c 


A weighted Its embedded Its embedded Its embedded 
digraph weighted graph digraph graph 


Figure 15.20 Embedded graphs 


In graph G, the weighted path length of the path cabd is |cabd| = 2 + 3 + 2 =7, and the shortest distance 
from c to d is 6 (along the path cad). But in graph G, that shortest distance is 1 (along the path cd). 

Note that graph G; is the same as that in Example 15.13 on page 292, and graph G, is the same as that 
in Example 15.1 on page 285. 

Figure 15.21 shows the adjacency matrix, the incidence matrix, and the adjacency list for graph G;. 


EULER PATHS AND HAMILTONIAN CYCLES 


An euler path in a graph is a walk that includes each edge exactly once. An euler cycle is a 
closed walk that includes each edge exactly once. An eulerian graph is a graph that has an euler 
cycle. 

Note that euler paths and cycles need not have distinct vertices, so they are not strict paths. 


EXAMPLE 15.22 Euler Paths and Cycles 


In the graph in Figure 15.22, the closed walk acedabefdbcfa is an euler cycle. So this is an eulerian 
graph. Note that every vertex in this graph has degree 4, and its 12 edges are partitioned into three circles. 
As the Theorem 15.8 reports, each of these two properties will always guarantee that the graph is eulerian. 


a 
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Figure 15.21 Adjacency matrix, incidence matrix, and adjacency list 


Theorem 15.8 Eulerian Graphs 
If G is a connected graph, then the following 
conditions are equivalent: 
1. Gis eulerian. 
2. The degree of each vertex is even. 
3. The set of all edges of G can be 
partitioned into cycles. 


A hamiltonian path in a graph is a path that 
includes each vertex exactly once. A hamiltonian 
cycle is a cycle that includes each vertex exactly 
once. A hamiltonian graph is a graph that has a 
hamiltonian cycle. 

Unfortunately, there is no simple characteriza- 
tion like Theorem 15.8 for hamiltonian graphs. In 
fact, the problem of finding such a simple charac- 
terization is one of the big unsolved problems in 
computer science. 


EXAMPLE 15.23 Hamiltonian Graphs 


oo} 


Figure 15.22 An eulerian graph 


In Figure 15.23, the graph on the left is hamiltonian. The graph on the right is not; it has a hamiltonian 


path, but no hamiltonian cycle. 


NN NN 


Figure 15.23 A graph with a hamiltonian cycle, and one with without one 
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DIJKSTRA’S ALGORITHM 


Dijkstra’ algorithm finds the shortest path from one vertex v, to each other vertex in a 
digraph. When it has finished, the length of the shortest distance from vy to v is stored in the 
vertex v, and the shortest path from vy, to v is recorded in the back pointers of v and the other 
vertices along that path. (See Example 15.24.) The algorithm uses a priority queue, initializing it 
with all the vertices and then dequeueing one vertex on each iteration. 


Algorithm 15.1 Dijkstra’s Shortest Paths Algorithm 
(Precondition: G = (V,w) is a weighted graph with initial vertex v).) 
(Postcondition: Each vertex v in V stores the shortest distance from v, to v and a back reference to 
the preceding vertex along that shortest path.) 
1. Initialize the distance field to 0 for v) and to © for each of the other vertices. 
2. Enqueue all the vertices into a priority queue QO with highest priority being 
the lowest distance field value. 
Repeat steps 4-10 until O is empty. 
4. (Invariant: The distance and back reference fields of every vertex that is not 
in Q are correct.) 
5. Dequeue the highest priority vertex into x. 
6. Do steps 7-10 for each vertex y that is adjacent to x and in the priority 


ies) 


queue. 

7. Let s be the sum of the x’s distance field plus the weight of the edge from x 
to y. 

8. Ifs is less than y’s distance field, do steps 9-10; otherwise go back to Step 
3 


9. Assign s to y’s distance field. 
10. Assign x to y’s back reference field. 


EXAMPLE 15.24 Tracing Dijkstra’s Algorithm 


This is a trace of Algorithm 15.1 on a graph with eight vertices. On each iteration, the vertices that are 
still in the priority queue are shaded, and vertex x is labeled. The distance fields for each vertex are shown 
adjacent to the vertex, and the back pointers are drawn as arrows. 


Figure 15.24 The first iteration of Dijkstra’s algorithm 


The first two iterations are shown in Figure 15.24. On the first iteration, the highest priority vertex is 
x =A because its distance field is 0 and all the others are infinity. Steps 7—10 iterate three times, once for 
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each of A’s neighbors y = B, C, and D. The values of s computed for these are 0 + 4 = 4, 0 + 6 = 6, and 
0+ 1=1. Each of these is less than the current (infinite) value of the corresponding distance field, so all 
three of those values are assigned, and the back pointers for all three neighbors are set to point to A. 


On the second iteration, the highest priority vertex among those still in the priority queue is x = D with 
distance field 1. Steps 7-10 iterate three times again, once for each of D’s unvisited neighbors: y = B, F, 
and G. The values of s computed for these are 1 + 4=5, 1 + 2 =3, and 1 + 6 = 7, respectively. Each of 
these is less than the current value of the corresponding distance field, so all of those values are assigned 
and the back pointers are set to D. Note how this changes the distance field and pointer in vertex C. 


©) 


7 


a 


Figure 15.25 The second and third iterations of Dijkstra’s algorithm 


The next two iterations are shown in Figure 15.25. On the third iteration, the highest priority vertex 
among those still in the priority queue is x = F with distance field 3. Steps 7—10 iterate three times again, 
once for each of F’s unvisited neighbors y = C, G, and H. The values of s computed for these are 3 + 1 = 4, 
3 +3 =6, and 3 + 5 = 8. Each of these is less than the current value, so all of them are assigned and the 
back pointers are set to F. Note how this changes the distance field and pointer in vertex C again. 


Figure 15.26 The fourth and fifth iterations of Dijkstra’s algorithm 


On the fourth iteration, shown in Figure 15.26, the highest priority vertex among those still in the prior- 
ity queue is x = B with distance field 4. Steps 7-10 iterate twice, for y = C and E. The values of s 
computed for these are 4 ++ 3 = 7 and 4+ 5 =9. The second of these is less than the current (infinite) value 
at E, so its distance field assigned the value 9 and its back pointer is set to B. But the s value 7 is not less 
than the current distance field for C, so its fields do not change. 


The algorithm progresses through its remaining iterations, shown in Figure 15.27, for x = C, E, G, and 
finally H the same way. 
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Figure 15.27 The sixth and seventh iterations of Dijkstra’s algorithm 


The final result is shown in Figure 15.28. It shows, for example, that the shortest path from A to E is 
ADFCE with length 6. 


Figure 15.28 The last two iterations of Dijkstra’s algorithm 


EXAMPLE 15.25 An Implementation of Dijkstra’s Algorithm 


Here is a Java implementation of Algorithm 15.1. It defines a Network class whose instances represent 


weighted digraphs. 

1 public class Network { 
2 Vertex start; 

3 

4 private class Vertex { 
5 Object object; 

6 Edge edges; 

7 Vertex nextVertex; 
8 boolean done; 

9 int dist; 

10 Vertex back; 

11 } 

12 

13 private class Edge { 
14 Vertex to; 

15 int weight; 

16 Edge nextEdge; 


17 } 
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19 public NetworkQ { 

20 if (start != null) { 

21 start.dist = 0; 

22 for (Vertex p = start.nextVertex; p != null; p = p.nextVertex) { 
23 p.dist = Integer.MAX_VALUE; // infinity 

24 } 

25 } 

26 } 

27 

28 public void findShortestPaths() { // implements Dijkstra’s Algorithm: 
29 for (Vertex v = start; v != null; v = closestVertex()) { 
30 for (Edge e = v.edges; e != null; e = e.nextEdge) { 

31 Vertex w = e.to; 

32 if C!w.done && v.dist+e.weight < w.dist) { 

33 w.dist = v.dist+e.weight; 

34 w.back = v; 

35 } 

36 } 

37 v.done = true; 

38 } 

39 } 

40 

a private Vertex closestVertex() f{ 

42 // returns the vertex with minimal dist among those not done: 
43 Vertex v = null; 

44 int minDist = Integer .MAX_VALUE; 

45 for (Vertex w = start; w != null; w = nextVertex) { 

46 if C!w.done && w.dist < minDist) { 

47 V=W; 

48 minDist = w.dist; 

49 } 

50 } 

51 return v; 

52 } 

53 } 


In this implementation, we have used a simple search method closestVertex() instead of a priority 
queue. This is less efficient, running in O(”) time instead of the O(/gn) time that a priority queue would 
use. 


GRAPH TRAVERSAL ALGORITHMS 


The paths produced by Dijkstra’s algorithm produce a minimal spanning tree for the graph. 
That is a spanning tree whose total weighted length is minimal for the graph; that is, no other 
spanning tree has a smaller total length. The spanning tree is formed in a breadth-first manner, by 
considering the vertices that are adjacent to the current vertex on each iteration. This is one of 
two general ways to traverse a graph. 

The breadth-first search algorithm is essentially the same as Dikstra’s algorithm without 
regard to the distance fields. 


Algorithm 15.2 The Breadth-First Search (BFS) Algorithm 

(Preconditions: G = (VE) is a graph or digraph with initial vertex v); each vertex has a boolean 
visited field initialized to false; 7 is an empty set of edges; L is an empty list of vertices.) 
(Postcondition: L lists the vertices in BFS order, and Tis a BFS spanning tree for G.) 
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EXAMPLE 15.26 Tracing the BFS Algorithm 
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Initialize an empty queue QO for temporary storage of vertices. 
Enqueue vy into Q. 

Repeat steps 4—6 while Q is not empty. 
Dequeue Q into x. 

Add x to L. 

Do step 7 for each vertex y that is adjacent to x. 
If y has not been visited, do steps 8-9. 

Add the edge xy to T. 

Enqueue y into Q. 


Table 15.1 shows a trace of Algorithm 15.2 on the graph shown Figure 15.29 Tracing the BFS 
in Figure 15.29. The start vertex is Vy) =A. 
Q x L y T 
A A A B AB 
B E AB, AE 
B,E B Cc AB, AE, BC 
E,C A,B F AB, AE, BC, BF 
E,C,F E A,B, E 
C,F Cc A, B, E, C D AB, AE, BC, BF, CD 
F,D G AB, AE, BC, BF, CD, CG 
F,D,G F A, B, E, C, F 
D,G D A,B, E, C, F, D 
G G A, B, E, C, F, D, G 


The resulting BFS order of visitation is returned in the list 
L=(A, B, E, C, F, D, G), and the resulting BFS spanning tree 
(Figure 15.30) is returned in the set T= {AB, AE, BC, BF, CD, 


CD}. 


The depth-first search algorithm uses a stack instead of 


a queue. 


Table 15.1 Trace of Algorithm 15.2 


Figure 15.30 Tracing the BFS 


Algorithm 15.3 The Depth-First Search (DFS) Algorithm 

(Preconditions: G = (VE) is a graph or digraph with initial vertex vo; each vertex has a boolean 
visited field initialized to false; 7 is an empty set of edges; L is an empty list of vertices.) 
(Postcondition: L lists the vertices in DFS order, and 7 is a DFS spanning tree for G.) 


lie 


20 Oy eit 


Initialize an empty stack S for temporary storage of vertices. 
Add v, to L. 

Push vy, onto S. 

Mark y visited. 

Repeat steps 6-8 while S is not empty. 

Let x be the top element on S. 

If x has any adjacent unvisited vertices, do steps 9-13. 
Otherwise, pop the stack S and go back to step 5. 
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9. Let y be an unvisited vertex that is adjacent to x. 
10. Add the edge xy to 7. 
11. Addy to ZL. 
12. Push y onto S. 
13. Mark y visited. 


EXAMPLE 15.27 Tracing the DFS Algorithm 


Table 15.2 shows a trace of Algorithm 15.3 on the same graph (Figure 15.29) as in Example 15.26. The 
start vertex is V) =A. 


L ) x y T 
A A A B AB 
A,B A,B 5) Cc AB, BC 
A, B, C A, B,C Cc D AB, BC, CD 
A, B, C, D A, B, C, D D G AB, BC, CD, DG 
A, B,C, D, G A,B,C,D,G G 
A, B, C, D D 
A,B,C Cc F AB, BC, CD, DG, CF 
A, B, C, D, G, F A, B, C, F F 
A, B,C Cc 
A,B B E AB, BC, CD, DG, CF, BE 
A,B,C,D,G,F,E A,B,E E 
A,B B 
A A 


Table 15.2 Trace of Algorithm 15.3 


The resulting DFS order of visitation is returned in the list 
L= (A, B, C, D, G F, E). Figure 15.31 shows the resulting 
DFS spanning tree, which is returned in the set 7 = {AB, BC, 
CD, DG, CF, BE}. 


Figure 15.31 Tracing the DFS 


Since the depth-first traversal uses a stack, it has a 
natural recursive version: 


Algorithm 15.4 The Recursive Depth-First Search (DFS) Algorithm 
(Preconditions: G = (V,E) is a graph or digraph with initial vertex x; each vertex has a boolean 
visited field initialized to false; Tis a global set of edges; L is a global list of vertices.) 
(Postcondition: L lists the vertices in DFS order, and 7 is a DFS spanning tree for G.) 

1. Mark x visited. 

Add x to L. 

3. Repeat steps 4-5 for each unvisited vertex y that is adjacent to x. 
Add the edge xy to T. 
Apply the DFS algorithm to the subgraph with initial vertex y. 


We 
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EXAMPLE 15.28 Tracing the Recursive DFS Algorithm 


Table 15.3 shows a trace of Algorithm 15.4 on the same graph as in Example 15.26 (Figure 15.29 on 
page 301). The start vertex Vv) =A. 
The result, of course, is the same as that in Example 15.27. The only real difference is that the explicit 


L x y T 
A A B AB 
A,B B C AB, BC 
A,B,C C D AB, BC, CD 
A,B, C, D D G AB, BC, CD, DG 
A,B, C, D, G G 
D 
C F AB, BC, CD, DG, CF 
A, B, C, D, G, F F 
C 
B E AB, BC, CD, DG, CF, BE 
A, B, C, D, GF, E E 
B 
A 


Table 15.3 Trace of Algorithm 15.4 


stack S has been replaced by the system stack that keeps track of the recursive calls. 
EXAMPLE 15.29 Implementing the Graph Traversal Algorithms 


Here is a Java implementation of the two traversal algorithms for the Network class introduced in 
Example 15.25 on page 299: 
1 public class Network { 


2 Vertex start; 

3 

4 private class Vertex { 

5 Object object; 

6 Edge edges; 

7 Vertex nextVertex; 

8 boolean visited; 

9 } 

10 

11 private class Edge { 

12 Vertex to; 

13 int weight; 

14 Edge nextEdge; 

15 } 

16 

17 public static void visit(Vertex x) { 
18 System.out.printIn(x.object) ; 


19 } 
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21 public void breadthFirstSearch() { 
22 if (start == null) { 

23 return; 

24 } 

25 Vector queue = new Vector(); 

26 visit(start); 

27 start.visited = true; 

28 queue.addElement(start) ; 

29 while (!queue.isEmpty()) { 

30 Vertex v = queue. firstElement() ; 
31 queue. removeElementAt (0) ; 

32 for (Edge e = v.edges; e != null; e = e.nextEdge) { 
33 Vertex w = e.to; 

34 if C!w.visited) { 

35 visit); 

36 w.visited = true; 

37 queue. addElement (w) ; 

38 } 

39 } 

40 } 

41 } 

42 

43 public void depthFirstSearch() { 

44 if (start != null) { 

45 depthFirstSearch(start) ; 

46 } 

47 } 

48 

49 public void depthFirstSearch(Vertex x) { 
50 visit(x); 

51 X.visited = true; 

52 for (Edge e = x.edges; e != null; e = e.nextEdge) { 
53 Vertex w = e.to; 

54 if Clw.visited) { 

55 depthFirstSearch(w) ; 

56 } 

57 } 

58 } 

59 } 


This uses the recursive version of the depth-first search. That requires the depthFirstSearch() 
method with zero parameters to start the recursive depthFirstSearch() method. 


Review Questions 


15.1. What is the difference between a graph and a simple graph? 
15.2. In an undirected graph, can an edge itself be a path? 
15.3. What is the difference between connected vertices and adjacent vertices? 


15.4 Using only the definition of graph isomorphism, is it easier to prove that two graphs are iso- 
morphic or to prove that two graphs are not isomorphic? Why? 


15.5 Are the five conditions in Theorem 15.4 on page 289 sufficient for two graphs to be isomor- 
phic? 
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15.7 


15.8 


15.9 


15.1 


15.2 
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Why is it that the natural definition of a simple graph prohibits loops while the natural defini- 
tion of a digraph allows them? 


True or false: 


a. 
b. 
c. 
d 

e. 


f. 
g. 


If a graph has n vertices and n(n—1)/2 edges, then it must be a complete graph. 

The length of a path must be less than the size of the graph. 

The length of a cycle must equal the number of distinct vertices it has. 

If the incidence matrix for a graph has n rows and n(m—1)/2 columns, then the graph 
must be a complete graph. 

In an incidence matrix for a digraph, the sum of the entries in each row equals the inde- 
gree for that vertex. 

The sum of all the entries in an incidence matrix for a graph is 2|F]. 

The sum of all the entries in an incidence matrix for a digraph is always 0. 


A graph (JV, E) is called dense if |E| = @(|V|’), and it is called sparse if |E| = O(\V)). 


a. 


b. 


Which of the three representations (adjacency matrix, incidence matrix, or adjacency 
list) would be best for a dense graph? 
Which representation would be best for a sparse graph? 


Why is it that, in the incidence matrix of a simple graph, there are always exactly two Is in 
each column? 


Problems 


Find each of the following properties for the graph shown c 


in Figure 15.32: 


Fon Fe me ao oS 


Its size n f 
Its vertex set V 

Its edge set F d 

The degree d(x) of each vertex x Figure 15.32 A graph 
A path of length 3 

A path of length 5 

A cycle of length 4 

A spanning tree 

Its adjacency matrix 

Its incidence matrix 

Its adjacency list 


Find each of the following properties for the digraph y c 
shown in Figure 15.33: a 


Smo oc Bo Ff 


Its size n 

Its vertex set V d e 
Its edge set FE 

The indegree id(x) of each vertex x 
The outdegree od(x) of each vertex x 
A path of length 3 

A path of length 5 

A cycle of length 4 


Figure 15.33 A digraph 
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i. Its adjacency matrix 
j- Its incidence matrix 
k. Its adjacency list 


15.3. Draw the complete graph on n vertices for n = 2, 3, 4, 5, and 6. 
15.4 Determine whether the graph G, in Figure 15.34 is either eulerian or hamiltonian. 


15.5 Determine whether the graph G, in Figure 15.34 is either eulerian or hamiltonian. 


G, | G, 


Figure 15.34 Two graphs 


15.6 Figure 15.35 shows 12 subgraphs of the graph in Figure 15.6 on page 288. Determine 
whether each of these is connected, acyclic, and/or spanning. 


ia. 
6. 


Figure 15.35 Twelve graphs 


L\ 


/ 
% 
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Figure 15.35 (continued) Twelve graphs 
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15.7 __ Find two nonisomorphic graphs for which all five conditions of Theorem 15.4 on page 289 
are true. 


15.8 Describe the adjacency matrix for the complete graph on v vertices. 
15.9 Describe the incidence matrix for the complete graph on n vertices. 


15.10 Let G, be the graph represented by the adjacency list shown in 


Figure 15.36: : 
a. Draw G,. C 5 
b. Is G, a directed graph? D AL—IB 
c. Is G, strongly connected? E C D 
d. Is G, weakly connected? F E 
e. Is G, acyclic? Figure 15.36 Adjacency list 
f. Give the adjacency matrix for G,. 

15.11 Let G, be the graph whose adjacency matrix is shown in Figure 15.37: 

a. Draw G,. PEE 
b. Is G,a simple graph? Skene 
c. Is G, a directed graph? 01010 
d. Is G, strongly connected? 11101 
e. Is G, weakly connected? 01101 


f. Is G, acyclic? 


15.12 Let G, be the weighted digraph shown in Figure 15.38: 
a. Draw the adjacency matrix for this graph. 
b. Draw the adjacency list for this graph. 
c. Is this graph connected? Justify your answer. 
d. Is this graph acyclic? Justify your answer. 


Figure 15.37 A matrix 


ior 


Figure 15.38 A digraph 


15.13. A wheel graph on n vertices is a graph of size n+1 consisting 
of a n-cycle in which each of the 7 vertices is also adjacent to 
a single common center vertex. For example, the graph shown 
in Figure 15.39 is the wheel graph on six vertices. 
Describe: 
a. The adjacency matrix of a wheel graph on 7 vertices 
b. The incidence matrix of a wheel graph on n vertices 
c. The adjacency list of a wheel graph on n vertices Higure 1539 A-graph 
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15.14 Let G, and G, be the graphs shown in Figure 15.40: 
a. Determine whether G, and G, are isomorphic. Justify your conclusion. 
b. Either find an euler cycle for G, or explain why it has none. 
c. Either find a hamiltonian cycle for G, or explain why it has none. 
s G 


Figure 15.40 Two graphs 


15.15 Trace Dijkstra’s algorithm (Algorithm 15.1 on page 297) on the graph in Figure 15.41, show- 
ing the shortest path and its distance from node A to every other node. 


As 
4 2 
Fe 5 B 
2 1 
1 ie 4 
3 \ 2 
E« 4 Cc 
3 1 
De 
Figure 15.41 A weighted graph Figure 15.42 A weighted graph 


15.16 Trace Dijkstra’s algorithm on the graph in Figure 15.42, showing the shortest path and its dis- 
tance from node A to every other node. 


15.17 There are four standard algorithms for traversing binary trees: the preorder traversal, the 
inorder traversal, the postorder traversal. and the level-order traversal. If a binary tree is 
regarded as a connected acyclic graph, which tree traversal results from a: 

a. Depth-first search 
b. Breadth-first search 

15.18 Determine which of the graphs in Figure 15.43 on page 310 are isomorphic. Note that all 

seven graphs have size 10. 
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G, 


es Ge 
Gs 
e 2 o 
G, @ e e e e 
e e e e 
e es o 
e e e e e 


Figure 15.43 Seven graphs 


15.19 For the weighted digraph G, shown in Figure 15.44 on page 311. 
a. Draw the adjacency matrix. 
b. Draw the adjacency list. 


15.20 Perform the indicated traversal algorithm on the graph shown in Figure 15.45 on page 311. 
Give the order of the vertices visited and show the resulting spanning tree: 
a. Trace the breadth-first search starting at node A. 
b. Trace the depth-first search of the graph, starting at node A and printing the label of 
each node when it is visited. 
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15.1 
15.2 


15.3 


15.4 


15.5 


15.6 


15.7 


15.8 


Figure 15.44 A digraph Figure 15.45 A graph 


Answers to Review Questions 


A graph is simple if it has no loops or repeated edges. 


No: In an undirected graph, an edge cannot be a path because an edge is a set of two elements (i.¢., an 
unordered pair) while a path is a sequence (i.e., an ordered list of vertices). 


Two vertices are connected if there is a path from one to the other. Two vertices are adjacent if they 
form an edge. 


Using only the definition of graph isomorphism, it is easier to prove that two graphs are isomorphic 
because it only requires finding an isomorphism and verifying that it is one. Proving from the defini- 
tion that two graphs are not isomorphic would require verifying that every one of the m! one-to-one 
functions is not an isomorphism. 


No: The five conditions of are not sufficient for two graphs to be isomorphic. It is possible for all five 
conditions to be true for two nonisomorphic graphs. (See Problem 15.7.) 


The reason that the natural definition of a graph prohibits loops is that an edge in a graph is a two-ele- 
ment set, and that requires the two elements to be different. In the natural definition of a digraph, an 
edge is an ordered pair, and that allows both components to be the same. 
a. True 
True 
True 
True 
False 


momo te 


True 
g. True 


The adjacency matrix is best for a dense graph because it is compact and provides fast direct access. 
The adjacency list is best for a sparse graph because it allows easy insertion and deletion of edges. 


212 


15.9 


15.1 


15.2 
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There must be exactly two Is in each column of an incidence matrix of a simple graph because each 
column represents a unique edge of the graph, and each edge is incident upon exactly two distinct ver- 


tices. 


For Fe mes ae oP 


For Fe ms ae op 


Solutions to Problems 


n=6. 

V = {a, b,c, d, e, fi. 

E = {ab, bc, bd, cd, ce, de, cf, df}. 

d(a) = 1, d(b) = 3, d(e) = d(f) = 2, d(c) = d(d) = 4. 
The path abcd has length 3. 

The path abcfde has length 5. 

The cycle bcedb has length 4. 

A spanning tree is shown in Figure 15.46. 

Its adjacency matrix is shown in Figure 15.47. 
Its incidence matrix is shown in Figure 15.47. 
Its adjacency list is shown in Figure 15.47. 


Figure 15.46 Spanning tree 


a bedef 
a\F\|tl|FlFlFl|F a b 
S|) F\/F|T|/T)F/E 
c/F/TtTl/Fi]ri|r|t I 
d|F/T/TI|F/T|T 7 ub £ d 
e/F/F|tT|tT|F|F 

FI\F\|T/|T/FI|F 
f c B d é if 
al1|/o]o|o|ojojo|}o d b c e if 
blia}1aj1alolojojojo 
c}o}/1}/o};ia}alajojo 
djo}o}ji}a}ojojaija S c d 
e}o}ojojlo;1i]/oj}i1jo 

o|/ololoj/oj1ijoja 
f Lf c d 


Figure 15.47 Adjacency matrix, incidence matrix, and adjacency list 


n=6. 

V = {a, b,c, d, e, fr. 

E = {ad, ba, bd, cb, cd, ce, cf, de, ec, fe}. 

id(a) = id(b) = id(c) = id(f) = 1, id(d) = id(e) = 3. 
od(a) = od(d) = od(e) = od(f) 
The path adec has length 3. 
The path fecbad has length 5. 
The cycle adcba has length 4. 
A spanning tree is shown in Figure 15.48. 

Its adjacency matrix is shown in Figure 15.49. 
Its incidence matrix is shown in Figure 15.49. 


1, od(b) = 2, od(c) = 4. 


De c 
a 
a eo 
d e 


Figure 15.48 Spanning tree 
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l. its adjacency list is shown in Figure 15.49. 
ao Boog 4a ef 


a|F|F|F/T|F/F a d 
b|T\|F/EF/T|FI/F 

Col ok a ea ua 

dq|/F\/F|F/F|T|F b il d 
e|F|F/|T/F|F/F 

f\F\F/F/F/T/F c B d @ Ff 
alil-1}o]o]/o}lolojlojojo d e 
b)o}1]1}-1}0}/0]/o0]};o0]o]}]o 
c/oO}o}o;1]/1)/1]11]0}-1] 0 

d\-1| 0 }-1} 0 }-1)0]/o0]1]o0]0 La € 
e|/0}0}]0]0]0 J-1] 0 |-1] 1 }-12 

f\o;}o0};o]o0];0}o0 }-1})/0}0}1 f zB 


Figure 15.49 Adjacency matrix, incidence matrix, and adjacency list 


15.3. The complete graphs are shown in Figure 15.50: 


_ARWY 


Figure 15.50 Complete graphs 


15.4 The graph G, cannot be eulerian because it has odd degree vertices. But the hamiltonian cycle shown 
in Figure 15.51 on page 314 verifies that it is hamiltonian. 


15.5 The graph G; is neither eulerian nor hamiltonian. 


15.6 a. Disconnected, cyclic, and spanning. 
b. Disconnected, acyclic, and spanning. 
c. Disconnected, cyclic, and spanning. 
d. Disconnected, cyclic, and not spanning. 
e. Connected, acyclic, and spanning. 
f. Connected, acyclic, and spanning. 
g. Connected, cyclic, and spanning. 
h. Connected, acyclic, and not spanning. 
i. Disconnected, cyclic, and spanning. 
j- Connected, cyclic, and not spanning. 
k. Disconnected, acyclic, and not spanning. 
1. Connected, acyclic, and not spanning. 


15.7. The two graphs shown in Figure 15.52 on page 314 are not isomorphic because the one on the left has 
a 4-cycle containing two vertices of degree 2 and the one on the right does not. Yet, all five conditions 
of Theorem 15.4 on page 289 are satisfied. 
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Figure 15.51 Hamiltonian cycle Figure 15.52 Nonisomorphic graphs 


15.8 The adjacency matrix for the complete graph on 7 vertices is an n-by-n boolean matrix with false 
value at each entry on the diagonal and true value at every other entry. 


15.9 The incidence matrices M,, for the complete graphs on n vertices are as follows: 
Its has m rows and n(n—1)/2 columns (see Theorem 15.2 on page 286). 
If n = 2, it is the 2-by-1 matrix containing true in both entries. 
If n > 2, it is the matrix A concatenated horizontally with the matrix obtained from M,,, by 
placing one row of all false values on top of it. 
The four matrices are shown in Figure 15.53. 


15.10 a. The digraph is shown in Figure 15.54. 

b. Yes, this is a digraph: It has at least one one-way edge. 

c. No, the digraph is not strongly connected: There is no path from C to D. 

d. Yes, the digraph is weakly connected: Its embedded (undirected) graph is connected. 
e. No, the digraph is not acyclic: It contains the cycle AFEDA. 

f 


Its adjacency matrix is shown in Figure 15.55 on page 315. 


15.11 a. The digraph G; is shown in Figure 15.56 on page 315. 

b. Yes it is a digraph: Its adjacency matrix is not symmetric. 
c. No, this is not a simple digraph because it has a loop. 

d. Yes, this digraph is strongly connected. 

e. Yes, this digraph is weakly connected. 

f. 


No, the digraph is not acyclic: It contains the cycle ADB. 


M, = 


Figure 15.53 Incidence matrices Figure 15.54 Digraph 
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000001 
001000 
010000 
110000 
001100 
1000010 


Figure 15.55 Incidence matrix Figure 15.56 Digraph 


The adjacency matrix is shown in Figure 15.57. 

The adjacency list is shown in Figure 15.57. 

The graph is not connected because there is no path from B to A. 
The graph is not acyclic because it contains the cycle BECDB. 


A 3|B 4/C 1|D 
03410 
100 0 2 B 2|E 
00 00 0 3 0 
o23 nm Cc 3|/D 
00 4 mm 
D 2|B 3\|C 
E 4|C 


Figure 15.57 Adjacency matrix and adjacency list 


The adjacency matrix for a wheel graph looks like matrix A shown in Figure 15.58 on page 316. 
The incidence matrix for a wheel graph looks like matrix B shown in Figure 15.58 (for the case 
n =A). In general, it will have n 1s followed by 7 Os on the first row. Below that will lie the iden- 
tity matrix (all 1s on the diagonal and 0s elsewhere) followed by the square matrix with 1s on 
the diagonal and the subdiagonal. Compare this with the recursive solution to Problem 15.9 on 
page 308. 

The adjacency list for a wheel graph looks like the list shown in Figure 15.58 on page 316. The 
edge list for the first vertex (the central vertex) has m edge nodes, one for every other vertex. 
Every other edge list has three edge nodes: one pointing to the central vertex (labeled a in Figure 
15.58) and one to each of its neighbors. 

The two graphs are isomorphic. The bijection is defined by the vertex labels shown in Figure 
15.59 on page 316. 

An euler cycle for G, is ABCDEBFCADFEA. 

A hamiltonian cycle for G, is ABCDFEA. 


The trace of Dijkstra’s algorithm is shown in Figure 15.60 on page 316. 


The trace of Dijkstra’s algorithm is shown in Figure 15.61 on page 316. 


a. 
b. 


If the depth-first search is applied to a tree, it does a preorder traversal. 
If the breadth-first search is applied to a tree, it does a level-order traversal. 
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Figure 15.58 Adjacency matrix and adjacency list 
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Figure 15.59 Isomorphic graphs 


“le 


Figure 15.60 Dijkstra’s algorithm Figure 15.61 Dijkstra’s algorithm 
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15.18 The seven graphs are labeled in Figure 15.62. Among them: 


G, is isomorphic to G, : The isomorphism is shown by the vertex labels a—/. 

G; is isomorphic to G, : The isomorphism is shown by the vertex labels p—y. 

G. cannot be isomorphic to any of the other graphs because it has 25 edges and all the others 
have 20. 

G; (and thus also G,) cannot be isomorphic to any of the other graphs because it has a pyramid 
of four adjacent 3-cycles (pqr, prs, pst, and ptq) and none of the other graphs (except G,) does. 
G. cannot be isomorphic to any of the other graphs because it has a chain of three adjacent 4- 
cycles (ABCD, CEFG, and FHI) and none of the other graphs (except G,) does. 

Similarly, G; cannot be isomorphic to any of the other graphs because it has a chain of four 
adjacent 3-cycles (POS, OSR, SRT, and RTU) and none of the other graphs (except G,) does. 
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Figure 15.62 Graph isomorphisms 


15.19 The adjacency matrix and the adjacency list are shown in Figure 15.63 on page 318. 


15.20 a. 


b. 


The breadth-first search visits ABDECHFIGKLJMONPQ,; its spanning tree is shown on the left 
in Figure 15.64 on page 318. 
The depth-first search visits ABCFEIHDKLMJGNPQO; its spanning tree is shown on the right 
in Figure 15.64 on page 318. 
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Figure 15.64 Breadth-first search and depth-first search 
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Essential Mathematics 


This appendix summarizes mathematical topics used in the study of data structures. 
THE FLOOR AND CEILING FUNCTIONS 


The floor and ceiling functions return the two nearest integers of a given real number. The 
floor of x, denoted by Lx], is the greatest integer that is not greater than x. The ceiling of x, 
denoted by | x |, is the smallest integer that is not smaller than x. 

Here are the main properties of these two functions. (The symbol Z stands for the set of all 
integers.) 


Theorem A.1 Properties of the Floor and Ceiling Functions 
1. |x| =max{meZ|m<x}, and [x] =min{neZ|n>x}. 
meen <x <|x|t+il,and[x]—l<x<[x]. 
Sel < |x| <x<[x]<x+1. 
4 IfneZandn<x<n+1,thenn=([x]|.IfneZandn—1<x<n,thenn=[x]. 
5. IfxeZ, then |x| =x =[x]. 
6. Ifx¢Z, then |x| <x < [x]. 
7. \x)=—-|-x |and [-x] =-|x]. 
Seeeee! |= |x| +land[x+1]=[x]+1. 


LOGARITHMS 


The /ogarithm with base b of a positive number x is the exponent y on b for which bY = x. For 
example, the logarithm of 1000 base 10 is 3 because 10° = 1000. This is written log,, 1000 = 3. 

The logarithm with base 2 is called the binary logarithm and is written lgx = log, x. For 
example, lg 8 = 3. 

As a mathematical function, the logarithm is the inverse of the exponential function with the 
same base: 

y=log,x @& bY =x 

For example, 3 = lg 8 because 23=8. 
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Theorem A.2 Laws of Logarithms 
1. log,(b”) =y 
f lo8ox =x 
log, uv = log, u + log,v 
log, u/v = log, u — log,v 
log, u” = v log, u 
log, x = (log.x)/(log, b) = (log, c) (log. x) 
7. For a positive integer n, [lg(n+1)]=Llgn| +1. 


DN Se 3D 


EXAMPLE A.1 Applying the Laws of Logarithms 


log, 256 = log, (2°) = 8 

log, 1000 = (log 19 1000)/(log jo 2) = 3/0.30103 = 9.966 

log, 1,000,000,000,000 = log, 1000* = 4 (log, 1000) = 4(9.966) = 39.86 
Unn)/gn) = dog.n)/(log,n) = log, 2 = In2 = 0.693147, for any n> 1 


ASYMPTOTIC COMPLEXITY CLASSES 


In computer science, algorithms are classified by their complexity functions. These are 
functions that describe an algorithm’s running time relative to the size of the problem. For 
example, the Bubble Sort belongs to the complexity class Or’). This means that if the Bubble 
Sort takes T milliseconds to sort an array of 7 elements, then it will take about 47 milliseconds to 
sort an array of 2n elements because (27)? = 4n?. 

The symbol ©() is one of five symbols used to describe complexity functions. They all can be 
defined in terms of the ratios of f(z) and g(7), where f{(7) is the algorithm’s timing function and 
g(n) is a characterizing function such as lg or n“. For a given function g(n), the five asymptotic 
complexity classes are 

O(g(n)) = { f(”) €S | f(n)/g(n) is bounded } 
X(g(n)) = { f() €S| g(n)/f(n) is bounded } 
O(g(n)) = { f(n) €S | f(2)/g(n) is bounded and g(n)/f(n) is bounded } 
o(g(1)) = { fi) €S | finig(n) > Oasn— & } 
o(g(n)) = { fm) €S| gnif(n) > 0 as n> © } 
These definitions assume that /(7) and g(7) that are positive ascending functions. 
As sets of functions, 
o(g) < O(g) 
o(g) € Ag) 
O(g) = O(g) 9 QA) 


EXAMPLE A.2 Asymptotic Growth Classes 


For every k> 0, n* = 0(2”), because n‘/2” > 0. 

For every k > 0, (Ign) = 0(n), because (ign)“/n = 0. 

For every base 5 > 1, log, = O(/gn), because log, n/lgn = log, 2. 
The factorial numbers 7! = (2”), because 2”/n! > 0. 
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The five complexity classes can be imprecisely described by these phrases: 
fin) = 0(g(7)) means that f(”) grows more slowly than g(7). 
fin) = O(g(n)) means that f(n) grows more slowly or at the same rate as g(7). 
fin) = O(g(n)) means that f(n) grows at the same rate g(7). 
Kn) = Q(g(n)) means that f(z) grows faster or at the same rate as (7). 
Kin) = &(g(n)) means that f(7) grows faster than g(7). 


EXAMPLE A.3 Asymptotic Growth Classes 


250 lgn = o(), because 250 lg grows more slowly than zn. 
0.086 n lgn = w(n), because 0.086 lgn grows faster than n. 


Keep in mind that these functions f(7), g(7), and so on, are usually used to describe how long 
it takes to run an algorithm. So if (7) grows “more slowly” than g(v), then the algorithm with 
complexity /(7) is generally faster than the algorithm with complexity g(7). Less time is better. 


THE FIRST PRINCIPLE OF MATHEMATICAL INDUCTION 


The First Principle of Mathematical Induction, also called “weak induction,” is often used to 
prove formulas about positive integers. 


Theorem A.3 The First Principle of Mathematical Induction 
If {P,, P, P3,... } 18 a sequence of statements such that: 

* P, is true. 

* Each statement P,, can be deduced from its predecessor P,, 1. 
Then all of the statements P,, Py, P3,... are true. 


EXAMPLE A.4 Weak Induction 


Suppose we want to prove that the inequality 2” < (n + 1)! is true for every n => 1. Then the sequence of 

statements is 
Pe “ohao 
Py. = 273! 
P;: =<! 
etc. 

The first few statements can be verified explicitly: 

2'=2<2=2! 
27=4<6=3! 
23=8<24=4! 

In particular, P; is true, satisfying the first of the two requirements for weak induction. This is called 
the base of the induction. 

To verify the second requirement, we have to show that each statement P,, can be deduced from its 
predecessor P,,_;. So we examine the two general statements P,, , and P,, and look for a connec- 
tion: 

Pigs Oe en! 
Pye 2" <(n+1)! 

To derive P,, from P,,_;, we note that 2” = (2)(2”"!) and (n + 1)! =(n + 1)(n!). Thus, if we assume that 
P,,_1 is true, then we have 2” = (HQ) <Q)! <(+ Da!) =(7+ I)!, because n + 1 > 2. 

Verifying the second requirement of mathematical induction is called the inductive step. 
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THE SECOND PRINCIPLE OF MATHEMATICAL INDUCTION 


The Second Principle of Mathematical Induction, also called “strong induction,” is nearly the 
same as the first principle. The only difference is in the inductive step. 


Theorem A.4 The Second Principle of Mathematical Induction 
If {P), P, P3,... } 18 a sequence of statements such that: 


* P, is true. 
* Each statement P,, can be deduced from its predecessors {P), P>, P3,..., P,_1}- 
Then all of the statements P,, Py, P3,... are true. 


So to verify the inductive step with strong induction, we may assume that all n—1 statements 
Pig PoP cas ook pay are Ue. 


EXAMPLE A.5 Strong Induction 


Prove that the Fibonacci numbers 0, 1, 1, 2, 3, 5, 8, 13, 21, . . . are asymptotically exponential. More 
precisely, we prove that F,, = O(2”), where the Fibonacci numbers F’, are defined as Fy = 0, F; = 1, 
and F,, = F,_| + F,_2. So our sequence of statements is 


Py: Fy < oe 
Po: Fy < Oh 
P3: P; < 23 
etc. 
These first few are true because of the following relationships: 
Py: Fy =1<2 


Po: fF, =2<4 
P3: F3=3<8 


For the inductive step, we assume that n—1 statements P,, Py, P3,..., P,,, are true and compare 
them with the mth statement P,: 

Py: Fy <2! 

Py: Fy <2? 

P33 Fy <2? 

Rise Bae 

Pea Fae 

Py Feo: 


Comparing the nth statement with the two that precede it, we see that 
By = Fo + Fo 
2 = OO" DF lyn lyn 149” 2 

So we can derive P,, from P,,_; and P,,_> like this: 


Fa Fa + ge re 


This proves that all the statements are true (i.e., F,, < 2” for all 7). 
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GEOMETRIC SERIES 


A series is a sequence of possibly infinitely many terms whose sum is to be determined. A 
geometric series is a series in which each term is the same multiple of its predecessor. For 
example, 20 + 60 + 180 + 540 + 1620 + 4860 + --- is a geometric series because each term is 3 
times the size of its predecessor. The multiplier 3 is called the common ratio of the series. 


Theorem A.5 Sum of a Finite Geometric Series 

Ifr 4 1, then 

n-1 _ a(1—r") 
Jer 


2 3 
ararvrar tar r:::T ar 


Here, a is the first term in the series, r is the common ratio, and n is the number of terms in the 
series. 


EXAMPLE A.6 Finite Geometric Series 


For the sum 20 + 60 + 180 + 540 + 1620 + 4860, the three parameters are a = 20, r= 3, and n= 6. So 
the sum is 


l-r 1-3 —2 


Theorem A.6 Sum of an Infinite Geometric Series 
If—1<r< 1, then 


EXAMPLE A.7 Infinite Geometric Series 


For the sum 0.42 + 0.0042 + 0.000042 + 0.00000042 + 0.0000000042 + - - -, the three parameters are 
a=0.42 and r= 0.01. So the infinite sum is 


a 042 042 42 14 


i-r 1-001 099 99 33 


Note that 14/33 = 0.4242424242 - . -. This repeating decimal is obviously the same as the infinite sum 
0.42 + 0.0042 + 0.000042 + 0.00000042 + 0.0000000042 + - - «. 


OTHER SUMMATION FORMULAS 


Theorem A.7 Sum of the First 1 Positive Integers 
n(n+ 1) 
2 


Note that the parameter 7 equals the number of terms in the sum. 


1+24+3+--4+n = 


EXAMPLE A.8 Summing Positive Integers 


The sum of the first 10 integers is 1 +2+3+4+4+5+4+6+7+8+4+9+4 10= 10(10+1)/2 =55. 
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Theorem A.8 Sum of the First n Squares 
- + 
Vaeeyge t= n(n “Mee 1) 
The expression on the right appears to be a fraction. But it will always turn out to be an integer 


because it equals a sum of integers. 


EXAMPLE A.9 Summing Squares 


The sum of the first six squares is 17 + 2? + 3% + 47 + 57 + 67 = 6(7)(13)/6 = 546/6 = 91. 


HARMONIC NUMBERS 


n H,, 

The harmonic series is the series of reciprocals: A pieces 

a 2 1.500000 
doce griged eth daa Tals: 3 1.833333 
Mik 23 4 5 : 

; a ; 4 2.083333 
It is not hard to see that this series diverges. That is, its partial ? ere 
sums increase without bound. ; Screneng 

The partial sums of the harmonic series are called the 4 See uoRSS 
harmonic numbers and are denoted by H,,: 3 Sa Gees 

H, 1 eee ae ed aeil aoa cael 9 2.828968 
mk 2 3 4 5 n 10 2.928968 
The first three harmonic numbers are 
I Table A.1 Harmonic numbers 
1 
He= 25 =a 


Din 


3 
1 11 
H, Dog I+5+3 


Although the harmonic numbers increase without bound, it is not obvious how fast they 
increase. Table A.1 suggests that they increase very slowly. 


The fact is that the harmonic numbers increase logarithmi- a nl 

cally: H, = O(ign). This means that they increase at about the : 
ones ‘ 0 i 

same rate as logarithmic numbers. More precisely, it means that 

both ratios H,,/Ign and lgn/H,, are bounded. ; : 
3 6 
STIRLING’S FORMULA ‘i i 
5 120 
The factorial numbers frequently appear in the analysis of 6 720 
algorithms. They are defined by: 7 5040 
8 40,320 
n! = T]& = (1)(2)3)(4)---(2) : es ae 


k=1 


The first ten factorials are shown in Table A.2. Table A.2 Factorial numbers 
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Unlike the harmonic sequence, the factorial sequence grows exponentially. This is reflected by 
Stirling s formula: 


nl = J3nn(3) 0/2", where 0<0 <1 


The value of the variable 9 depends upon 7, but in any case it is bounded between 0 and 1. Thus, 
for large n, the exponent 0/12n will be very close to 0, making the factor gon very close to 1. 
Consequently, Stirling’s formula is often expressed in this simpler approximate form: 


n 
n 
~ i 
n!\ 2nt (2) 


The factorial numbers grow exponentially: n! = Q(2”). This fact follows from Stirling’s 
formula. 

Another important consequence of Stirling’s formula is that lg(!) is asymptotically equivalent 
to nlgn: lg(n!) = O(n l|gn). 


FIBONACCI NUMBERS 7 
n n 
The Fibonacci numbers also frequently appear in the analysis of : : 
algorithms. They are defined by: : ; 
2 1 
0,ifn=0 3 2 
ale an OR ee | 4 3 
F,_,+F,_»ifn>1 5 5 
The first 13 Fibonacci numbers are shown in Table A.3. i : 
: : , 7 13 
Like the factorial sequence, the Fibonacci sequence grows 
exponentially, as is verified by De Moivre s formula: i ee 
9] 34 
pe whee el Ss ways tee 10 55 
Ss ; : 11 89 
Thus, Ff, = O(”). Here, = 1.618034 and y = —0.618034. These 12| 144 


two constants are the golden mean and its conjugate. 
Table A.3 Fibonacci numbers 


Review Questions 


A.l_ A function /(Q is called idempotent if f({(x)) = f(x) for all x in the domain of /(). Explain why 
the floor and ceiling functions are idempotent. 


A.2. What is a logarithm? 

A.3 What is the difference between weak induction and strong induction? 
A.4 How can you decide when to use strong induction? 

A.5 What is Euler’s constant? 

A.6 What makes Stirling’s formula useful? 
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A.l 
A.2 
A.3 


A.4 
AS5 
A.6 
A.7 


A.l 


A.2 
A3 


A4 


A.5S 


A.6 


A.l 
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Problems 


Prove Theorem A.1 on page 319. 
Prove Theorem A.2 on page 320. 


True or false: 
a. f=o(g) @ g=a/(f) 
b. f= Og) & g=QP/) 


ec f=O(g) > g=O(/) 

d. f=O(g) > f= Og) 

e. f= O(g) > f= Qe) 

f. f=O(h) \g=O(h) > f+ g=O(h) 
g. f=O() A g=O() = fg=O() 
h. n?=O(nlgn) 

i. n?=O(nl1gn) 

j. 2? =O lgn) 

k. lgn=Q@(n) 

Ll Ign=o(n) 


Prove Theorem A.5 on page 323. 
Prove Theorem A.6 on page 323. 
Prove Theorem A.7 on page 323. 


Run a program that tests De Moivre’s formula on page 325 by comparing the values obtained 
from it with those obtained from the recursive definition of the Fibonacci numbers. 


Answers to Review Questions 


The floor and ceiling functions are idempotent because they return integer values, and according to 
Theorem A.1 on page 319, the floor or ceiling of an integer is itself. 


A logarithm is an exponent. It is the exponent on the given base that produces the given value. 


The First Principle of Mathematical Induction (“weak” induction) allows the inductive hypothesis that 
assumes that the proposition P(7) is true for some single value of n. The Second Principle of Mathe- 
matical Induction (“‘strong” induction) allows the inductive hypothesis that assumes that all the propo- 
sitions P(x) are true for all & less than or equal to some value of n. 


Use weak induction (the first principle) when the proposition P(7) can be directly related to its prede- 
cessor P(n—1). Use strong induction (the second principle) when the proposition P(7) depends upon 
P(x) fork < n-1. 


Euler's constant is the limit of the difference (1 + 1/2 + 1/3 +...+ I/n) — Inn. Its value is approxi- 
mately 0.5772. 


Stirling’s formula is a useful method for approximating n! for large n (e.g., n > 20). 


Solutions to Problems 


Proof of Theorem A.1 on page 319: 
a. The relationships Lx] = max{meZ|m <x}, and [x] = min{neZ | n = x} are merely 
restatements of the definitions of | x | and [x]. 
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A.2 


A.3 


A.4 


A.5 


A.6 


b. Let m= |x | andn=[ x ]. Then by definition, m<x<m+1andn—1<x<n.Thenx—-1l<m 
andn<x+1.Thusx-—l<m<x<n<xt+l. 

c. The inequalities x — 1 < |x| < x < [x ]<x+ 1 merely summarize those in b above. 

d. LetneZ such thatn <x<n+1,andletdA={meZ|m <x}. Thenne A and Lx | =maxA, 
son < |x]. Now ifn< |x], thenn+1 < |x] (since both n and [x | are integers). But n + 1 
< |x |, by hypothesis. Therefore, m = | x |. The proof of the second part is analogous. 

e. Assume that x eZ (1.e., x is an integer). Then let n =x in d above: x <x<x+1sox= 
|x | andx-—1<x<xsox=[x]. 

f. Assume that x ¢ Z (i.e., x is not an integer). Let w=x-—|x | andv=|x]—x. Then, byc, u > 0 
and v > 0. Also by e,x—1<|x |=x-—wandv+x=[x]<x+1,sou<1andv<1.Thus0 < u 
<land0 < v<1. But wand v cannot be integers because if either were, then so would x because 
x=|x|+u=|x]-v. Therefore, 0<u<1and0<v<l,sox=|x|t+u>|x|andx=[x]-v 
<[x]. 

g. Let n=-—|-x]|. Then (7) =| (—x) |so by c, (x) -—1< (Cn) < (—), sox <n<xt+l1,sox<n 
andn—1<x,son—1<x < n. Thus by d,n=[x |. Thus—|-x |. =| x], so |-x | =—|x ]. The 
second identity follows from the first by replacing x with —x. 

h. Letn=[x+1].Thenbyc, (x+1)-—l<n< («+1),sox-l<n-1<xandx=(«+1)-1< 
n. Thus, n—1 < x <n; that is, (7-1) < x<(m—1) +1. Thus by d, (n— 1) =|-x |, so [x +1 ]= 
n=|.x |+ 1. The proof of the second identity is similar. 


Proof of Theorem A.2 on page 320: 
a. Let x =b». Then by definition, log,(b”) = log,(x) = y. 
b. Let y = log,x. Then by definition, b !°8* = bY = x. 
c. Let y=log,u and z = log, v. Then by definition, u = b” and v = b?, so uv = (b”)(b7) = b”*7, so 
log,(uv) =y +z=log,u + log, v. 
By Law c above, log, v + log, u/v = log,(v u/v) = log, u, so log, u/v = log, u + log, u. 
Let y = log, u. Then by definition, w= 5”, so u” = (b”)” = 6”. Then by definition, log,(u”) = vy 
=v log,u. 
Let y = log, x. Then by definition, x = b’, so log.x = log.(b”) = y log.b, by Law e above. Thus 
log, x =y = (log,.x)/(log, 5). 
True 
True 
True 
False 
True 
True 
False 
False 
False 
True 
False 
True 


Proof of Theorem A.5 on page 323: , ; ; ; 

Let S=atart+ar tar +::+ar".ThenrS=ar+ar t+ar +ar ++ +ar",soS—rs 
=a-ar",(1-r)S=a(1-r”), and thus S=a(1 -—r” VU. - 7). 

Proof of Theorem A.6 on page 323: 

If—1 <r< 1, then as nv increases without bound, 7” shrinks down to zero. Let r” =0 in the formula 
in Theorem A.5. 

Proof of Theorem A.7 on page 323: 

Let S 1+2+3+-+:-+n. Then S =n+-:-+3+2+1 also. Add these two equations, 
summing the 27 terms on the right pairwise: (1 + 7), (2 + (7-1)), etc. There are n pairs, and each pair 
has the same sum of 7 + 1. So the total sum on the right is n(7 + 1). Then, since the sum on the left is 
2S, the correct value of S must be n(n + 1)/2. 


° & 


™ 
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A.7 Program to test the De Moivre’s formula for Fibonacci numbers: 
public class Fibonacci { 
public static void main(String[] args) { 

final double SQRT5 = Math.sqrt(5.0); 

final double PHI = (1 + SQRT5)/2; 

final double PSI = (1 - SQRT5)/2; 
long fO, fl = 0, f2 = 1; 
< 


for Cint n = 2; n 32; n++) f{ 
fO = fl; 
fl = f2; 
f2 = f1 + f0; 


double fn = (Math.pow(PHI, n) - Math.pow(PSI, n))/SQRT5; 
System.out.print("%4d%12.2", f2, fn); 


Abstract data type, 3 
abstract modifier, 8 
AbstractList class, 69 
Access 

package, 9 

private, 19 

protected, 19 

public, 19 
Accessor method, 7, 106 
Ackermann function, 178 
Acyclic graph, 288 
Adel’son-Velskii, G.M., 237 
Adjacency list, 291, 293 
Adjacency matrix, 290, 293 
Aggregation, 10 
Algorithm: 

binary search, 31, 46, 168, 241 

bubble sort, 256, 320 

bucket sort, 271 

Dijkstra, 297 

Euclidean, 171 

heap sort, 265 

insertion sort, 258 

merge sort, 260 

quick sort, 263 

radix sort, 270 

selection sort, 257 

shell sort, 259 
Almost complete binary tree, 207 
Amortized constant time, 86 
API, 16 
Array, 26 

associative, 148 

generic, 106 
Array component, 26, 33 
Array duplication, 28 
Array element, 26, 33 
ArrayDeque class, 69 
ArrayIndexOutOfBoundsException, 26, 33 
ArrayList, 100 
ArrayList class, 69 
asListQ method, 96 
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assert statement, 106 
Associative array, 148 
Autoboxing, 97 

AVL tree, 237, 240, 241 


Babbage primes, 36 
Bag, 72 
Basis of recursive algorithm, 166 
Behavior of an object, 3 
Bidirectional iterator, 87 
BIFO container, 247 
Binary logarithm, 319 
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